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his  doctorate  in  mathematics  at  Princeton  University  in  1924. 
As  a  graduate  student  he  taught  mathematics  at  both  Washing- 
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in  this  position  until  1931.  In  1929  he  spent  several  months  at 
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Congress,  and  the  Institute  of  Mathematical  Statistics.  He 
served  as  vice  president  of  the  American  Statistical  Association 
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at  the  important  contributions  that  Professor  Hotelling  has 
made  to  the  mathematical  and  social  sciences.  Few  men  can 
match  his  mastery  of  mathematics,  statistics,  and  economics, 
and  fewer  yet  can  match  his  record  of  major  contributions  in  all 
three  fields.  In  the  field  of  economics,  his  contributions  to  the 
theory  of  utility  and  demand,  welfare  economics,  and  the 
incidence  of  taxation  have  already  assumed  the  stature  of 
classics.  In  addition,  his  contributions  to  statistics  have  found 
applications  in  econometrics  as  well  as  in  other  areas  of  the 
physical,  biological,  and  social  sciences. 

Professor  Hotelling's  pioneering  work  in  the  teaching  of 
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University  and  the  University  of  North  Carolina.  The  generations 
of  graduate  students  who  have  worked  with  him  also  indicate 
the  major  role  he  has  played  in  developing  mathematical 
statistics  and  mathematical  economics  in  the  United  States. 

The  essays  in  this  volume,  dealing  with  topics  in  mathematical 
economics  and  econometrics,  are  offered  by  the  authors  as 
indications  of  their  respect  for  the  contributions  that  Harold 
Hotelling  has  made  to  economics.  The  authors,  some  of  them 


HAROLD    HOTELLING 


IX 
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PART  I 

On  the  Mathematics  of  Optimization 


PAUL  A.  SAMUELSON 


Structure  of  a  Minimum  Equilibrium  System 


1.  The  functions  met  with  in  classical  thermodynamics  possess 
an  abstract  structure  shared  by  equilibrium  systems  in  quite 
diverse  other  fields  —  e.g.,  statics,  economic  theory,  the  recently 
developed  mathematical  theory  of  games  and  linear  programming, 
etc.1)  Given  a  set  of  numerical  data  on  punch  cards  purporting 
to  come  from  an  equilibrium  system  sharing  these  properties, 
we  should  be  able  to  determine  whether  such  a  supposition  is 
tenable.  Once  the  full  empirical  implications  of  the  hypotheses 
embodied  in  the  equilibrium  theories  have  been  elucidated,  we 
can,  if  we  wish,  dispense  with  that  theory  except  for  mnemonic 
purposes. 

By  use  of  matrix  notation,  the  present  paper  gives  a  brief 
but  exhaustive  summary  of  the  properties  of  such  systems, 
confining  itself  for  simplicity  to  the  regular  case  of  unique  and 
different iable  functions.  Then  it  develops  an  exhaustive  set  of 
conditions  that  any  finite  set  of  observations  must  meet  if  the 
theory  in  question  is  not  to  be  refuted. 

2.  Imagine  that  we  are  given  a  set  of  2n  variables  that  come 
in  conjugate  pairs  (xifyt).  (In  phenomenological  thermodynamics 
these  might  be  designated  as  generalized  coordinates  and 
generalized  forces,  such  as  entropy  and  temperature,  volume  and 
negative  pressure,  chemical  mass  and  chemical  potential,  etc.)2) 

x)  For  the  thermodynamic  and  economic  interpretation  of  the  mathematical 
relations  here  analyzed  see  such  references  as  E.  A.  Guggenheim,  Thermodynamics 
(Amsterdam,  1950);  L.  Tisza,  "On  the  General  Theory  of  Phase  Transitions," 
in  Phase  Transformations  in  Solids  (Wiley,  1951,  N.Y.),  pp.  1  —  37;  P.  A. 
Samuelson,  Foundations  of  Economic  Analysis  (Cambridge,  Mass.  1947),  Ch.  3. 

2)  To  avoid  indeterminacy  of  scale,  all  extensive  quantities  can  be  thought 
of  as  expressed  in  ratio  to  the  amount  of  an  extensive  variable,  xn+v  which 
is  not  included  in  this  analysis. 
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2  ESSAYS  IN  ECONOMICS  AND  ECONOMETRICS 

There  is  assumed  to  exist  n  equilibrium  conditions  or  equations 
of  state  between  these  2n  variables,  defined  as  follows: 

Axiom  for  a  Regular  Equilibrium  System:  Corresponding 
to  prescribed  values  for  n  variables  (yv  .  .  .,  yn),  values  of  the 
conjugate  variables  (xv  .  .  .,  xn)  are  determined  so  as  to 
provide  a  regular  minimum  to  F(x;  y)  =  F(xv  .  .  .,  xn; 
Vi>  •  •  •>  Vn)  =  Y{xv  .  .  .,  xn)  -  2?  ViVa  so  that 

(1)  dFldxi  =  0  =  Yi(zv...,xu)-yi     (*  =  1 ft) 

where  subscripts  to  functions  always  stand  for  partial  deriva- 
tives with  respect  to  the  indicated  variables,  and  where  the 
Hessian  matrix  of  second  derivatives  of  F  and  Y  with  respect 
to  the  x's  is  everywhere  defined  and  is  positive  definite  so  as 
to  insure  a  regular  minimum. 

Knowledge  of  the  function  Y  above  determines  all  properties 
of  the  equilibrium  system.  (In  thermodynamics,  Y  is  internal 
energy.)  These  regularity  conditions  of  smoothness  and  con- 
vexity could  be  relaxed.  But  for  the  present  it  is  sufficient  to 
deduce  the  full  implications  of  the  regular  case.  We  have  im- 
mediately from  the  Axiom: 

Theorem  1:  Functions  constraining  a  set  of  variables  are 
equivalent  to  a  regular  equilibrium  system,  if  and  only  if, 
they  can  be  thrown  into  the  form: 

(2)  xt  =  Xi(yv  .  .  .,  yn)  eee  Xt(yv  .  .  .,  yn)     (i  =  1,  .  .  .,  n), 

where  X(yv  .  .  .,  yn)  is  an  existent  twice  differentiable  func- 
tion, whose  Hessian  matrix  of  partial  derivatives  —  which  is 
also  the  symmetric  Jacobian  of  the  transformation  (2)  —  must 
be  positive  definite. 

The  necessity  part  of  this  theorem  follows  immediately  from 
the  fact  that  the  relations  (1)  can  always  be  inverted  by  virtue 
of  the  fact  that  their  Jacobian  is  postulated  to  be  the  non- 
singular  matrix  of  a  positive  definite  quadratic  form.  Equations 
(2)  represent  such  inverse  functions,  and  their  Jacobian 
[Xfl  =  [Y^Y1-  Since  the  inverse  of  a  symmetric  matrix  must  be 
symmetric,  our  right  to  set  X*  =  X{  is  established  and  the 
integrability  or  symmetry  conditions  guarantee  the  existence 
of   the    function   X(ylt  .  .  .,  yn)  =  ^  ^  Xidxi  independently   of 
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path,  which  is  unique  up  to  an  inessential  integration  constant. 
Finally,  because  the  inverse  of  a  positive  definite  matrix  must 
be  positive  definite,  this  stipulated  property  is  indeed  necessary. 

The  sufficiency  part  of  the  theorem  can  be  most  easily  seen 
by  recognizing  the  dualism  between  the  x's  and  y's.  If  we 
interchanged  the  letters  in  (2),  we  should  have  a  relation  with 
all  the  properties  of  (1);  call  it  (1)*  and  call  (2)*  the  relation 
(1)  with  the  labels  interchanged.  Then  by  the  just  proved 
necessary  condition,  it  follows  that  (1)*  implies  (2)*,  which  is 
all  that  we  need  for  our  sufficiency  proof  of  Theorem  1.  This 
also  shows  the  truth  of 

Corollary    1:   The   variables    (xlt  .  .  .,  xn)    and    (ylt  .  .  .,  yn) 

are  mutually  conjugate:  the  same  equilibrium  system  can  be 

defined  by  the  dual  minimum  problem. 

F*(y\  x)  =  X(yv  .  .  ,,  yn)  —  ^  x^iy    a    minimum    for    pre- 
scribed x's. 

3.  The  equilibrium  conditions  of  (1),  or  (2),  were  seen  to  be 
n  constraints  among  2n  variables.  There  are  a  vast  number  of 
ways  of  taking  as  prescribable  variables  n  out  of  2n  variables, 
namely  (2n)\j(n\)2  permutations.  But  in  the  general  case,  there 
is  no  reason  to  believe  that  any  n  variables  taken  at  random 
can  be  independently  varied.  Thus,  in  the  simple  case  where 
Y(xv  x2,  .  .  .)  =  j&tf,  *ne  equations  of  state  are  x{  =  yit  and 
the  Hessians  of  Y  and  of  X  are  the  identity  matrixes.  Clearly 
for  this  system,  the  variables  (xv  yv  xs,  .  .  .)  cannot  be  indepen- 
dently varied. 

In  general,  the  only  admissible  sets  that  we  can  be  sure  of  are 
those  generated  by  the  following  rule:  Select  one  and  only  one 
member  from  each  conjugate  pair  of  variables.  Since  there  are 
two  independent  ways  of  selecting  from  each  pair,  there  are 
obviously  exactly  2n  such  admissible  sets  of  independently 
prescribable  variables.  If  we  follow  the  convention  of  suitably 
renumbering  the  variables,  we  can  write  any  of  the  2n  sets  in 
one  of  the  following  n  -j-  1  forms: 

(Vl>  •  •  •>  V«),     (Vl>  •  •  •>  yn-l>    *n).  •  •  ■»   iVv  •  •  •»  Vm>  Xm+1>  •  •  ■»  Xn)> 

.  .  .,  (y^  x2,  .  .  .,xn),   {xv  .  .  .,xn).- 
The  first  and  last  of  these  are  the  "unmixed"  sets  of  (1)  and  (2). 
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That  the  remaining  "mixed"  sets  are  admissible  as  independently 
prescribable  follows  from  the  fact  that  every  principal  minor 
of  the  definite  Jacobians  of  (1)  and  (2)  are  necessarily  definite 
and  non-singular.  This  assures  us  that  the  first  m  equations  of 
(1)    can    be    solved    explicitly    for    (xv  .  .  .,  xm)    in    terms    of 


(2/i,  •  •  •>  Vr 


'm+V 


.  .,  xn)   since  the  crucial  matrix  of  partial 


derivatives  [dyjdx^  is  the  non-singular  m  rowed  principal 
minor  of  [Yw];  with  these  x's  solved  for,  we  can  substitute  in 
the  remaining  n  —  m  equations  of  (1)  to  get  (ym+1,  .  .  .,  yn)  in 
terms  of  the  same  independent  variables.  (A  dual  similar  ar- 
gument could  be  made  concerning  the  solvability  for  (ym+1, .  ..,yn) 
of  the  last  n  —  m  equations  of  (2).)  Thus,  by  use  of  the  Implicit 
Function  Theorem  —  and  by  an  extension  of  it  that  gives  no- 
where vanishing  principal  minors  as  sufficient  conditions  for 
uniqueness  of  our  solutions  in  the  large  —  we  can  deduce. 

Theorem  2:  For  any  of  the  2n  admissible  sets  of  n  independent 
variables,  consisting  of  one  and  only  one  member  of  each  pair 
of  conjugate  variables,  we  can  in  regular  systems  —  and  only 
in  such  systems  —  determine  unique  and  differentiable  values  of 
the  remaining  variables.  After  we  have  suitably  renumbered 
the  variables,  we  can  write  these  relations  as 


(3) 


n), 


0  ^m  f^  n 
with  the  partitioned  Jacobian  matrix  of  skew-symmetric  form 

'dx  dx~ 

y-l  —Y^Y 


J  m 


(±) 


By  dx 
cfy  dy 
dy  dx. 


[■*■    XX        >  -   XX    -   0503 

Y-  Y-1    Y—Y-  Y~XY 


xx  ■*■  XX       xx-1 


-Y 


Y  - 

XX    M   XX 


V 

yy    ,VB 


Xyy 


**~yy  **-vV         >  yy      -*         *-  yy  ' 

r  a  bi 

=  ,     0  <  m  <  n, 

l-B'CJ         ~      ~ 

Yxx  being  understood  to  stand  for  the  first  m  by  m  rows  and 
columns  of  [Ytj], . . .,  Xm  for  the  last  n  —  m  rows  and  columns 
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of  [Xit]t  etc.;  and  the  symmetric  diagonal  blocks  A   and  C 

being  positive  definite.3) 

In  (3)  the  position  of  the  semi-colon  will  serve  to  identify 
the  value  of  m,  except  in  the  limiting  cases  where  m  =  0  or  n, 
in  which  case  we  have  the  unmixed  variables  and  equations  of 
(1)  and  (2).  To  prove  the  various  relations  of  (4),  we  make  the 
appropriate  matrix  substitutions  (or  matrix  premultiplica- 
tions)  in  the  differentials  relation  of  (1)  and  (2) 


'YxxYxx 

Y-  Y- 

■i  xx  •*■  xx 


(S) 


dx 

0    -In- J 

dx 

dy 

Jy- 

-    =0, 


dx 
dx 
dy 
dy. 


to  get  the  first  row's  relations  of  (4),  which  are  seen  to  be  dual 
forms.  The  next  row  of  (4)  immediately  follows.  The  last  form 
is  important:  it  summarizes  the  facts  that  the  diagonal  blocks 
are  symmetrical,  but  that  the  off-diagonal  matrixes  are  skew- 
symmetric  with  (dx/dx)  =  —  (dy/dy)'.  These  properties 
follow  from  the  symmetry  relations  [Yw]  =  [YH],  while  the 
definiteness  of  the  positive  blocks  follows  from  the  definiteness 
of  [Y{j]  and  [Xtj]  and  of  the  inverses  of  their  principal  minors. 
To  prove  the  converse  part  of  the  theorem  —  that  the  structure 
of  (3)  and  (4)  implies  the  structure  of  (1)  or  (2)  —  note  that 
by  the  same  routine  substitutions  by  which  (5)  went  into  (4), 
but  working  in  reverse,  we  find  for  the  Jacobian  of  (y,  y)  in 
terms  of  (x,  x) 


3)  If  determinacy  of  scale  has  not  been  achieved  by  the  device  of  the  first 
footnote,  then  Y(xlt  .  .  .,  xn)  will  be  a  homogeneous  function  of  order  one,  and 
its  n2  Hessian  [Yt7]  will  be  positive  definite  of  rank  n  —  1.  The  functions 
(2)  will  then  be  determinate  only  in  ratio  and  not  in  scale.  For  m  <  n,  the 
functions  (3)  will  be  perfectly  determinate,  and  can  be  shown  to  be  homo- 
geneous of  order  one  in  [xm+1,  .  .  .,  xn]  alone,  requiring  C  to  be  positive  semi- 
definite  of  rank  (n  —  m  —  1)  with  [xm+1,  .  .  .,  xn]  C  =  0  and  [xv  .  .  .,  a?„]  = 
fon+1 ««]£'• 
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~dy 

dy~ 

dx 

dx 

dy 

Jdx 

dy 

dx_ 

dx*  dx         r     A-1  —A-^B     1 

dv   dv         L-B'A-1     C+B'A-iB] 


(6) 


This  is  seen  to  be  symmetric.  It  remains  to  show  that  in- 
dependently of  B,  the  positive  definiteness  of  C  and  A  will 
guarantee  that  this  whole  matrix  is  positive  definite,  and  hence 
of  the  stipulated  regular  form  [YH]. 

We  make  use  of  the  easily  demonstrable  algebraic  fact  that 
any  positive  definite  A-1  can  be  written  in  an  infinity  of  dif- 
ferent ways  as  D'D,  where  D  is  non-singular. 

Now  consider  the  quadratic  form 


0  =  01+02  =[*!,•  ••>*„, 

=  LZm+l>  •  '  •>  Zn}^LZm+l> 


+  [*! 


.if        D'D- 

'"J  l-(DB)'D. 


Hi  •  •  ■>  zn\J nlzV  •  ■  ■>  zmi  zm+l>  ■  ■  •>  zni 

•>ZnY 

-D'(DB)  I 
(DB)'D,  (DB)'(DB)} 

=  [<),..  .,  0,  zm+v  .  .  .,  zn]  C[0,  .  .  .,  0,  ^+1,  .  .  .,  zj  +  jnr 

where  W  =  [zv  .  .  .,  zm]D'  -  [zm+1,  .  .  .,  zn](DB)'. 

Q±  is  positive  semi-definite  because  C  was  positive  definite 
by  hypothesis;  Q2  is  a  sum  of  squares  and  hence  positive  semi- 
definite.  But  for  z'z  ^  0,  where  Qx  =  0,  Q2  >  0  and  where 
02  =  0*  0i  >  0;  hence,  Q  is  finally  proved  to  be  positive  definite. 

4.  Returning  now  to  the  mixed  relations  (3),  their  skew- 
symmetry  property  warns  us  that,  as  they  stand,  these  functions 
are  not  partial  derivatives  of  any  existent  parent  function. 
However,  because  of  the  special  skew-symmetry  between  the 
off-diagonal  blocks,  it  is  clear  that  changing  the  algebraic  sign 
of  the  two  upper  blocks  will  give  us  a  symmetric  matrix 

[:*■  ~c] 

which  can  be  regarded  as  the  second-derivative  Hessian  matrix 

of  an  existent  function  for  each  m.  Hence 

Theorem  3:  Any  regular  equilibrium  system  in  2n  conjugate 
variables  has  associated  with  it,  for  each  of  the  2n  admissible 
sets  of  independent  variables,  a  characterizing  function,  which 
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may  be  written 

to    f* 

(8)  w  r 

+  2    y%i, . .., 2/m;  ...itxn)dxt 

m+lJ 

where  the  line  integral  is  independent  of  path  by  virtue  of 
the  symmetry  relations.  In  terms  of  the  relevant  characterizing 
function,  we  can  rewrite  equations  (3) 

yt  =  +dP(yv  ...,ym;  .  .  .,  xn)ldxt,     i  =  m  +  1,  .  .  .,  n. 

Corollary  1:  For  m  =  0,  P=  Y;  and  for  w  =  w,  P=  —X; 
for  all  m  and  for  values  of  the  2n  variables  satisfying  the  n 
equilibrium  conditions, 

to  n 

(9)  P(ylt  .  .  .,  ym;  xm+1,  .  .  .,  sn)  =  Y  —  J0yixj  =  -X+  2Vixi> 

1  TO+l 

but  these  are  not  identities  in  the  2n  variable  space. 

Corollary  2:  For  prescribed  values  of  (xv  . . .,  xm;ym+1>  . .  .,yn) 
the  expression 

to  n 

(10)  F*^,  £;  £,  y)  =  P(fo, . . .,  £m;  £m+1. . .,  *J+2>,jf,— X^ 

1  TO+l 

reaches  a  regular  saddlepoint  or  minimax  when,  and  only 
when,  the  remaining  variables  (yv  .  .  .,  ym;xm+1,  .  .  .,  xn)  are  in 
their  regular  equihbrium  configuration;  i.e.,  for  an  equilibrium 
position  (4,  ...;...,  x°n;  y\}  ...;..  .  jfl) 

(11)  Fw(z°,  £°;  £,  y°)  ^  Fm(z0,  £°;  £°,  y°)  ^  Fw(z°,  £;  £°,  y°) 

with  the  inequaHties  valid  for  distinct  points. 

In  classical  thermodynamics  these  2n  characterizing  func- 
tions have  a  variety  of  notational  descriptions  and  go  by  such 
names  as  internal  energy,  enthropy,  enthalpy,  Helmholtz  free 
energy,  Gibbs  potential  or  free  energy,  etc.  Knowledge  of  any 
one  of  them  provides  a  succinct  summary  of  all  the  regular 
equilibrium  relations,  and  by  use  of  the  transformations  of  (6) 
and  (4)  we  can  in  principle  go  from  the  properties  of  any  one  to 
the  properties  of  any  other. 
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The  symmetry  properties  of  (4)  provides  immediate  proof  of 
the  existence  of  each  of  these  characterizing  functions  described 
in  theorem  3.  To  prove  Corollary  1,  we  may  use  the  properties 
of  Legendre  transformations  or  of  differential  geometry;  but 
remaining  within  the  limits  of  the  present  straightforward 
algebraic  manipulation  of  Jacobian  matrices,  we  make  the 
equilibrium  substitutions  to  get 

m  m 

i  i 

and 

m 

d(Y  -  2ytxt)       m 

~ =  I  Vt-VtX-Xi  =  0-xt,     (i  =  1,  .  .  .,») 

(12)  « 

d(Y  ~  IV,*t)         m 

-^ =  I(Yj-yj)x{+Yi=0+yi,  (i  =  ni+l,...,n) 

dXi  i 

which  is  seen  to  be  identical  with  P  if  the  inessential  integration 
constant  is  appropriately  adjusted.  By  using  the  dualism 
between  Y  and  X,  the  relations  of  P  to  —  (X  —  2^+i  x^^ 
shown  in  (9)  are  verified. 

To  prove  the  saddlepoint  property  of  Corollary  2,  we  note  the 
equivalence  of 

dFm  dFm 

(13)  — -  =0     (*  =  l,:..,m);  -T-r  =0     (*  =  m  +  1, :. ■.  .,  n) 

to  (3)';  and  we  note  that  in  changing  the  algebraic  signs  in  the 
upper  blocks  of  (4),  we  achieve  (7)'s  symmetry  but  at  the  cost 
of  making  its  upper  diagonal  block  — A  =  — Y~l  be  negative 
rather  than  positive  definite.  Together  with  the  positive  defini- 
teness  of  C  =  X~y,  this  is  sufficient  for  the  saddle-point  or 
minimax  property. 

This  minimax  property  shows  us  that  the  notion  of  an  un- 
mixed set  of  conjugate  variables  (yv  .  .  .,  yn)  and  (xv  .  .  .,  xn) 
is  not  arbitrary.  We  are  generally  forced  to  group  yx  and  y2 
together  and  are  not  free  to  interchange  the  x  and  y  designations 
of  yx  and  xv  (Thus,  entropy  and  volume  belong  together  in  a 
sense  that  temperature  and  volume  do  not;  this  we  know  from 
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the  mathematics  as  well  as  from  the  physics.)  For  if  we  link 
up  admissible  sets  of  conjugate  variables  without  distinguishing 
which  are  x's  and  which  are  y's,  the  Jacobian  (4)  of  the  resulting 
transformation  (3)  will  have  a  determinable  number  of  rows  and 
columns  that  yield  skew-symmetry.  Thus,  wis  a  determinable 
number  which  can  be  verified  to  be  0  or  n  or  to  be  in  between. 
This  enables  us  to  find  the  proper  way  of  dividing  the  2n  con- 
jugate variables  into  two  unmixed  groups.  Of  course,  which  we 
call  x  and  y  has  no  mathematical  significance  because  of  the 
complete  dualism  between  them.  (In  the  singular  case  where 
whole  rows  and  corresponding  columns  of  Jm  vanish,  we  shall 
later  show  that  the  exact  size  of  m  is  indeterminable  but  in- 
different, since  the  system  then  splits  off  into  independent 
constellations.) 

5.  Theorem  3  and  its  corollaries  each  contains  within  itself, 
and  is  equivalent  to,  the  previous  theorems  and  Axiom.  If  any 
2n  variables  can  be  labelled  so  that  n  of  them  can  be  expressed 
in  terms  of  the  remaining  n  by  a  transformation  with  the  regular 
minimax  properties  of  (11),  such  a  system  must  have  all  the 
properties  of  a  regular  equilibrium  system. 

For  each  of  the  2W  admissible  sets  of  independent  variables, 
there  will  be  n(n  —  l)/2  symmetry  or  skew-symmetry  relations. 
In  thermodynamics  these  are  called  Maxwell  relations,  and  any 
one  complete  set  of  them  implies  all  the  rest.  Matrix  notation 
and  operations  themselves  constitute  a  most  excellent  mnemonic 
device;  so  the  transformation  of  (4)  and  (6)  represent  an  excellent 
substitute  for  the  Bridgman4)  and  similar  formulas  by  which 
partial  derivatives  of  one  set  can  be  converted  into  partial 
derivatives  of  another.  Given  numerical  coefficients,  we  can 
efficiently  go  from,  say,  J6  to  /9,  by  the  series  of  steps  /6,  J7, 
J8,  /9,  where  at  each  stage  the  only  needed  operations  are  of  the 
Gauss-Doolittle  type  convenient  for  desk  and  other  calculators, 
namely  a  —  be,  and  (a  —  bc)jd.  In  practice,  there  is  never 
need  actually  to  renumber  variables  since  the  four  partitioned 
matrices  can  always  be  represented  by  different  colors  for  the 
numbers  or  by  appropriate  underlining. 

4)  P.  W.  Bridgman,  A  Condensed  Collection  of  Thermodynamic  Formulas 
(Harvard,  1925). 
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We  might  call  a  system  a  "regular  variational  system"  if  it 
satisfies  all  the  symmetry  equalities  of  a  regular  minimum  system, 
whether  or  not  it  also  has  the  definiteness  properties  on  its 
Jacobians  of  a  minimum.  Before  exploring  further  the  inequalities 
implied  by  the  minimum  property  itself,  we  first  give  a  single 
minimum  formulation  that  is  capable  of  handling  all  admissible 
mixed  cases  simultaneously. 

6.  Our  original  axiom  assures  us  that  F(x;  y)  attains  a 
minimum  if  we  set  the  x's  at  their  optimal  equilibrium  values 
in  terms  of  the  y's  as  determined  by  the  equihbrium  relations 
(1)  or  (2).  Thus,  the  minimum  obtainable  of  F  will  depend  only 
on  the  y's  and  we  may  write  it  as  F(y)  ^  F(x;  y),  with  the 
equality  sign  holding  in  equilibrium,  and  only  then.  It  will  be 
easily  shown  that  F(y)=  —X(y)  for  an  appropriate  choice 
of  integration  constant  in  the  latter  dual  function  to  Y(x). 
This  suggests 

Theorem  4:  For  any  regular  equilibrium  system,  we  can 

find   two  mutually  conjugate   functions   Y(xlf  .  .  .,  xn)  and 

xfoi>  •  •  ->2/n)>  sucn  that 

(14)      N(xv . . .,  xn;  yp  . . .,  yn)  =  Y(xv  . . .,  xn) 

+  X(y1,...,yn)-JfXiyi 

i 

reaches  a  regular  minimum  value  of  zero  if,  and  only  if,  the 

2n  variables  are  in  their  n  equilibrium  relations  to  each  other. 

The  Hessian  matrices  of  Y  and  X  will  be  positive  definite 

everywhere   and  in   the   equihbrium   configurations   will  be 

exact  inverses  of  each  other. 

Note  that  this  minimum  formulation  includes  as  special  cases 
the  dual  minimum  problems  of  the  Axiom  and  Corollary  1. 
Note  that  it  is  the  only  formulation  yet  stated  that  treats  both  sides 
of  the  dual  with  perfect  symmetry,  and  the  only  minimum  for- 
mulation that  handles  all  admissible  mixed  sets  of  variables. 
If  the  Y,  X,  and  P  functions  are  called  "potentials,"  then 
N(x;  y),  which  is  defined  in  the  2n  space  and  not  alone  on  the 
^-dimensional  equilibrium  locus,  is  in  the  nature  of  "superfluous 
or  disequilibrium  potential." 

To  prove  theorem   (4),  we  note  that  by  F(y)'s  definition, 
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F(x;  y)  —  F{y)  does  have  the  stipulated  properties  of  N(x;  y). 
It  remains  to  show  that  X(y),  as  previously  defined  from  (2), 
with  its  Hessian  [Xtj]  =  \Yij'\~1,  is  for  an  appropriate  choice 
of  integration  constant  identical  to  —  F(y).  But  note  that  for 
m  =  0,  P(ylt  .  .  .,  yn),  as  defined  already  in  (8),  is  identically 
F(Vv  -  -  •>  yn)>  an(i  as  shown  in  (9), 

dF      dP(yv  .  .     yn) 

—  = - =  -xi     («  =  1, . .  .,  n) 

which  agrees  with  (2)'s  definition  of  X.  Hence,  X(yv  .  .  .,  yn)  = 
—  F{yv.  ..,yn). 

Corollary  1:  The  regular  equilibrium  is  defined  in  terms  of 
any  admissible  set  of  mixed  variables  by 

dN 

—  =0  =  Yt{xv  .  .  .,  xn)  -  yt     (i  =l,...,n) 

(15)  X* 

dN 

=  0  =  Xt{ylt  .  .  .,  yn)  -xt     (i.  ==  m  +  1> .  .  .,  ft) 

where    the    Hessian    of    N    with    respect    to     (xv  .  .  .,  xm; 
Vm+v  •  •  •»  Vn)  is  positive  definite  being  of  the  form 


(16) 


r^1  o  - 


If  we  were  to  prescribe  an  inadmissible  set  of  variables, 
involving  both  an  xt  and  yit  then  we  would  get  —  1  terms  in  the 
off-diagonal  blocks  of  the  Hessian  matrix  (16),  and  we  could 
not  infer  that  it  has  the  positive  definiteness  properties  needed 
for  a  minimum.  Thus,  in  the  case  earlier  mentioned  where 
xt  =  yif  N{x;  y)  =  \Jx*  -  JXiVi  =  H>;-*/,)2  +  ifi£  and 
prescribing  (xlt  yv  y3>  .  .  .,yn)  and  minimizing  with  respect  to 
the  remaining  variables  would  give  N  =  \{xx  —  y-J2,  which 
need  not  vanish.  This  confirms  that  only  2n  sets  of  variables 
are  generally  independently  variable. 

It  will  be  noted  that  (15)  shows  the  first  m  equations  of  (1) 
and  the  last  (n  —  m)  equations  of  (2)  together  to  be  equivalent 
to  (1),  or  to  (2),  or  to  (3).  Transforming  the  differentials  of  (15) 
by  the  usual  methods  will  give  us/TO  in  the  next  to  the  last  form 
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of  (4),  a  form  that  treats  both  aspects  of  the  dual  with  perfect 
symmetry.  It  will  be  noted  that  the  P  functions  defined  in  (9) 
did  not  treat  the  dual  functions  Y  andX  perfectly  symmetrically, 
there  being  a  difference  in  algebraic  sign.  But  we  can  split  N 
up  into  two  parts,  so  that 


N{x;  y) 

m                                         n 

=  (Y-Iyjxi)-{-X+2yixi) 

1                                         m+1 
m                                     n 

(17) 

=  -vr-5'jizl)-iz-5Vtxi) 

1                                  m+l 

(in  equil.)  =  P(yv 

•  •  v^;.1.  .,xn)—P(yi>  •  •  ->ym'>  •  •  ■ 

>Xn) 

=  -?(</!> 

•  •  -i  Vmy  •  •  •>  Xn)   H~  *      \XV  '  '  ''  Xm'>  • 

•  -,y 

where  the  definition  of  the  functions  P*,  mutually  complementary 
to  P,  is  obvious.  Through  the  whole  2n  space,  P  and  P*  are 
independent  of  each  other,  even  though  they  are  complementary 
on  the  ^-dimensional  equilibrium  locus. 

The  minimum  formulation  in  terms  of  N  immediately  confirms 
that  a  set  like  (yv  .  .  .,yn\  xm+1,  .  .  .,  xn)  is  truly  a  mixed  one, 
since  generally  N  —  Jj*  xi  yj  can  be  resolved  into  two  additive 
functions  of  n  variables  each  in  essentially  one  way  only,  as  can 
be  determined  from  the  pattern  of  zeros  in  the  2n  by  2n  Hessian 
matrix  of  N.  The  singular  case  mentioned  earlier,  where  Y(x) 
and  X(y)  can  be  each  written  as  the  sum  of  two  functions 
involving  no  overlapping  variables,  is  instantly  revealed  by  the 
use  of  N(x;  y).  Thus,  if  Y(xv  .  .  .,  xn)  =  R(xv  .  .  .,  xr)  + 
S(xr+1,  .  .  .,  xn),  X  will  be  capable  of  a  similar  split  and  we  are 
really  dealing  with  two  entirely  independent  sub-systems. 
That  being  the  case,  there  is  no  mathematical  way  of  relating 
xx  to  xn  in  any  sense  different  from  the  relating  of  x±  to  yn. 

Finally,  it  may  be  mentioned  that  the  general  minimum 
formulation  in  terms  of  N  remains  valid  in  irregular  cases  where 
some  of  the  functions  have  corners  with  undefined  partial 
derivatives,  and  where  the  equilibrium  relations  may  not  be 
unique  or  continuous.  (Even  the  simplest  case  of  phase  equilibrium 
of  a  solid  and  liquid  provides  an  "irregular"  example  where 
functions  have  a  corner  and  partial  derivatives  are  not  defined.) 

7.  This  exhausts  the  full  empirical  implications  upon  our 
observable  functions.  If  someone  specifies  in  detail  any  given 
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set  of  functions,  we  can  in  principle  compute  various  sets  of 
partial  derivatives,  test  various  of  the  symmetry  relations,  and 
various  of  the  determinant al  conditions  for  definiteness.  However, 
suppose  that  the  functions  specified  are  purely  empirical  and 
not  exactly  represented  by  any  finite  combination  of  known 
mathematical  formulas.  We  have  a  procedure  for  calculating 
the  values  of  the  equilibrium  functions  at  any  point.  Also,  we 
can  approximate  to  various  of  their  derivatives;  however,  unless 
we  are  given  certain  a  priori  bounds  on  the  higher  derivatives 
of  the  function,  no  matter  how  far  we  push  our  computations, 
we  cannot  strictly  infer  the  goodness  of  our  approximations 
and  therefore  cannot,  strictly  speaking,  ever  test  a  symmetry 
relation  with  complete  rigor. 

8.  Fortunately,  in  applied  sciences,  we  usually  have  at  least 
vague  notions  concerning  smoothness  of  higher  derivatives,  and 
we  can  therefore  compute  an  expression  like 

fY*(x1+h1,x2,...)-Y*(x1,x2,...) 

kl  Yi(x1,x2+h2,...)-Yi(x1,x2,...)\ 

K  I 

for  "small"  h±  and  h2,  and  decide  whether  it  is  sufficiently  far 
from   zero  to  refute  the  hypothesis  that  dY2/dx1  =  dY1/dx2. 

9.  J.  R.  Hicks5)  has  derived  an  interesting  generalization  of 
the  above  finite  tests  for  integrability  or  symmetry.  For  this 
purpose,  let  us  suppose  that  the  (Y1,  .  .  .,  Yn)  functions  of  the 
x's  are  indeed  the  partial  derivatives  of  a  function  Y(xv  .  .  .,  xn), 
which  has  continuous  second  partial  derivatives.  Then  we  can 
write  various  Taylor's  expansions  such  as 

Y(x\,  x\, ...)-  Y(xl  x\, .  .  .)  =  £  (x)  -  x^Y^,  xs, . . .)  +  0 

= i  &  -  *$y,&  4  •••)  +  *; 

(19) 

1 

n       -  .     V (a*    a*  \   -U  V.(a>?    ^  \ 

5)   J.  R.  Hicks,  A   Revision  of  Demand  Theory  (Oxford,  1956),  p.  126. 


1 

=  i(*J-a}JY,(44- 

1 

•  ■)  -  K 

~  2(*J  -  *?)  Y'(^'  "*  ' 

■  ■)  +  f,(4  4  • 

•  •) 
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where  (xv  x2,  .  .  .)  or  (x)  represents  a  point  intermediate  between 
(x1)  and  (x2);  where  R2  and  R2  represent  remainder  terms 
involving  second-degree  or  higher  terms  in  [x]  —  x2) ;  and  where 
R3  involves  third-degree  or  higher  terms  in  (x)  —  x2)  because 
R'2  and  —  R2  can  be  shown  to  differ  by  terms  of  higher  than  the 
second  degree. 

Following  Hicks,  for  the  rest  of  this  section  I  omit  all  terms 
of  higher  than  second  degree  and  state  relations  that  the  resulting 
approximations  must  satisfy.  Of  course 

0  =  [Y(^)  -  Y(x2)]  +  [Y{x2)  -  Y(x*)]  +  [Y(x*)  -  Yfr1)] 

(20)    =  i2(^^-2^V))+i(2^^-2^^)+i(K2/}-2^^) 

where  the  equihbrium  substitutions  yi  ==  Yt(xlt  .  .  .,xn)  have 
been  made. 

This  is  a  generalized  integrability  condition  that  any  three 
near-by  points  must  approximate  to.  This  general  Hicks  con- 
dition includes  as  a  special  case  the  above  relation  (18);  to  see 
this  set  (x1)  =  (xlt  x2,  .  .  .),  (x2)  =  (x^h^  x2,  .  .  .),  (a3)  = 
(xv  x2  +  h2,  .  .  .),  and  rearrange  the  resulting  terms  of  (20) 
to  get   (18). 

An  obvious  slight  generalization  of  the  Hicks  reasoning  will 
give  us  conditions  that  any  4,  5,  6,  .  .  .,  or  r  distinct  points  must 
satisfy  as  they  come  close  together. 

Theorem  5:  Any  r  nearby  points  observed  from  a  regular, 

continuously  smooth,  minimum  system  must  satisfy  the  finite 

equality 


0  =  2  (Y{xk)  -  Yix^1))  +  (Y(xr)  -  Yix1)) 

Jc=l 

(2i)  =  i  ai{xf-xf+i)(yf+yri))+  iife-*j)w+i£)+*i 


=  1(  &,y*-2$y)) + (I^-I^f) + 

l  l 


l 

where  i?3  involves  terms  of  at  least  the  third  degree  in  (x^—xf). 
By  setting   (x1),   (x2),   (x3),   (#4)  equal  to     (xv  x2,  .  .  .), 
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(xx  +  hv  X2,  .  .  .),  (xv  x2  +  h2,  .  .  .),  («i  +  ^i',  ^2>  •  ■  •)>  we  can 
convert  the  above  into  the  equality  that  must  hold  in  the  limit 
between  the  various  more  familiar  finite  approximations  to 

d3Y  d   (  d*y   \      _d_/d2Y\ 

dx\  dx2      dxx  \dxx  dx2)       dx2  \  dx\  J ' 

and  still  other  special  cases  can  be  derived  involving  chains 
of  any  number  of  observations  in  excess  of  2. 

10.  Equations  (21)  would  be  adequate  tests  for  symmetry 
and  for  the  variational  (as  distinct  from  minimum)  aspect  of 
the  problem  were  it  not  for  the  fundamental  methodological 
difficulty  already  mentioned  in  paragraph  7.  Even  if  we  could 
exactly  calculate  —  free  of  all  statistical  or  experimental  error  — 
any  point  on  the  ^-dimensional  equilibrium  locus  in  the  2n- 
dimensional  space,  nonetheless  we  could  never  be  perfectly 
certain  that  an  observed  value  for  the  bracketed  expression 
in  (18)  did  or  did  not  refute  integrability.  If  the  bracketed  test 
expression  exactly  equals  zero,  we  cannot  be  sure  that  for  still 
closer  together  points  it  will  not  turn  out  to  differ  from  zero. 
If  the  test  differs  from  zero,  we  cannot  be  sure  that  the  difference 
is  greater  than  the  admissible  remainder  term  R%.  Pragmatically, 
we  may  sidestep  this  methodological  difficulty  by  hypothesizing 
a  'priori  knowledge;  or  we  may  not  care  whether  the  equilibrium 
system  is  "really"  variational,  provided  it  behaves  sufficiently 
like  one  for  the  purpose  at  hand. 

The  advantage  of  being  able  to  convert  a  problem  into  variatio- 
nal form  seems  to  be  mainly  mnemonic.  Instead  of  having  to 
record  or  remember  n  independent  functions  Yt(xv  .  .  .,  xn), 
we  need  only  to  know  one  function  Y(xv  .  .  .,  xn).  Or  it  is  enough 
to  know  everywhere  the  one  function  dY(xv  .  .  .,  xn)/dxv  with 
dY(xv  x2,  .  .  .,  xn)jdx2  being  only  known  everywhere  in  the  sub- 
space  where  x  =  xx\  likewise  dY(xv  x2>  x3>  .  .  .,  xn)/dx3  need  be 
known  only  for  varying  (x3,  .  .  .,  xn)  in  the  subspace  where 
(xv  x2)  =  (xv  x2);  .  .  .  finally  dY(xv  .  .  .,  xn_v  xn)/dxn  need  be 
known  only  for  varying  xn  with  the  remaining  variables  fixed. 
(E.g.  in  a  simple  entropy,  volume,  absolute  temperature,  pres- 
sure system,  knowledge  everywhere  of  the  pressure-volume- 
temperature   "equations  of  state"   need  be  supplemented  by 
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knowledge  of  the  specific  heat  function  at  one  temperature 
alone  to  define  internal  energy  and  all  thermodynamic  behavior 
everywhere.) 

Also,  a  general  one-to-one  transformation  of  coordinates, 
as  defined  by  xi  —  /*(»!,  .  .  .,  vn),  will  lead  to  new  variables 
wv  .  .  .,  wn,  conjugate  to  the  v's  and  determined  by  the  identity 
dY  =  1lwsdVl  =  TVi^i  =  Hill  Yi{f1,-,fn)tii^,...,vn)}  dv}. 

11.  It  turns  out,  however,  that  we  can  in  principle  give  an 
exact  test  to  refute  the  hypothesis  that  we  have  a  minimum 
system.  Given  a  set  of  finite  observations,  presumed  free  of 
statistical  error  or  with  known  bounds  on  that  error,  we  can 
specify  exact  inequalities  that  must  be  satisfied.  Testing  an 
approximate  equality  is  in  principle  a  methodologically  different 
problem  from  the  easier  case  of  verifying  or  refuting  an  inequality. 
From  this  viewpoint  it  is  desirable  to  reverse  the  usual  relative 
treatment  of  a  minimum  and  variational  principle.  In  many 
discussions  of  classical  mechanics,  authors  are  satisfied  to  state 
a  variational  principle  without  being  interested  in  whether  the 
resulting  conditions  relate  to  "stationariness"  or  to  the  more 
difficult  question  of  "definite  minimization."  From  the  standpoint 
of  exact  empirical  refutation,  it  turns  out  to  be  conceptually 
much  easier  to  test  the  hypothesis  of  definite  minimization  than 
to  test  the  variational  hypothesis.  To  repeat,  this  is  because  an 
inequality  can  be  empirically  refuted  more  easily  than  can  an 
approximate  equality. 

12.  Let  us  consider  various  distinct  equilibrium  positions 
(x1;  y1),  (x2;  y2),  (x3;  y3),  .  .  ..  Then  by  definition  of  the  regular 
minimum 

1  1 

or 

m 

(22)        Y(x\,  x\,  ...)-  Y(x$,  4  .  .  .)  <  2  (*}  -  a})*}. 

1 

showing  that  R'2  in  (19)  is  necessarily  negative.  Similarly  for  R'2'. 
Hence 


(23)   0  =  \Y{x^)-Y{x*)}+  [Y(^)-F(^)] <2 (*}-*?) (tf-tf); 


MINIMUM   PROPERTIES    OF   EQUILIBRIUM    SYSTEMS  17 

or,  letting  x)  —  x2  =  Axj}  y1.  —  y2  =  Ayjt  we  have  holding 
between  any  two  distinct  equilibrium  states 

(24)  Ax1Ay1  +  Ax2Ay2  +  .  .  .  +  AxnAyn  >  0. 

In  the  special  case  where  only  one  variable  of  any  admissible 
set  of  prescribed  parameters  is  allowed  to  change,  we  get 

(25)  0  +  .  .  .  +  0  +  AxiAyi  +  0  +  .  .  .  +  0  >  0, 
showing  that   conjugate  variables  must   change  in  the  same 
direction  when  no  change  takes  place  in  at  least  one  of  every 
other  pair  or  conjugate  variables.  (This  is  a  weak  form  of  the 
so-called  Le  Chatelier  principle.) 

If  we  have  m  observations,  we  can  apply  the  above  test  (24) 
to  each  of  the  r(r  —  l)/2  pairs  of  observations.  If  any  one  of 
the  inequalities  fails  to  be  satisfied,  then  the  minimum  hypo- 
thesis is  refuted.  However,  if  we  have  more  than  2  observations, 
further  independent  inequalities  must  be  satisfied. 

From  (22),  we  have 

(26)  iy)(x)  -  x*)  +  2  »J(aj  -«?)+£  »}<«$-  *})  >  0 

111 

=  [Y^J-Yfc2)]  +  [Y{x2)  -  Y{x*)]  +  [Y(x*)  -  Yix1)] 

a  test  that  every  triad  of  distinct  points  must  satisfy.  Reversing 
the  order  of  the  points,  we  get  a  second  independent  inequality 
binding  the  same  triad. 

Generally,  for  any  circular  chain  of  distinct  points 
(x1;  y1),  (x2;  y2),  .  .  .,   {xr;  yr),  (sf+1;  yr+1)  =  (x1;  y1),  we  have 
by  similar  reasoning 

(27)  2  [2  y\{x\  -  x^)]  >  0  =  i  [Y(x()  -  Y(xi+1)]. 

t=l    j  i=l 

Reversing  the  order  of  the  points  from  (1,  2,  .  .  .,  r,  1)  to 
1,  r,  .  .  .,  2,  1  gives  a  similar  relation 

r+l  r 

(28)  0  >  ^  EtfW  -  *;-1)]  =  -  III  2//+1H  -  "£")]• 

2        j  1      j 

By  duality,  we  can  infer  relations  just  like  (27)  and  (28), 
such  as 

(29)  2  Q>,%!  -  2/j-1)]  <  0; 
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(29)  can  be  derived  directly  from  (28)  by  rearranging  terms,  and 
hence  does  not  represent  an  independent  condition. 

The  number  of  independent  inequalities  is  equal  to  the  number 
of  distinct  circular  chains  that  can  be  formed  from  the  finite 
given  number  of  observations,  r.  If  r  ==  2,  we  have  a  single 
condition,  namely  (24).  If  r  —  3,  we  can  form  3  independent 
pairings,  and  hence  we  have  three  chains  of  length  two  to 
provide  us  with  inequalities.  In  addition,  we  have  two  in- 
dependent chains  of  length  three;  viz.  123,  321.  So  altogether 
for  r  =  3,  we  have  5  independent  inequalities  to  test. 

Given  any  k  points,  we  can  make  Nk  =  (k  —  1)!  circular 
chains  involving  all  k  elements.  To  prove  this,  note  that  N2  =  1, 
because  12  and  21  are  the  same  circular  chain.  Also,  we  go 
from  a  chain  involving  12  ...  k  —  1  in  some  order  to  one  in- 
volving 12...,  k  —  1,  &  in  some  order  by  putting  between  any 
two  adjacent  numbers  of  the  shorter  chain  the  new  element  k. 
Since  there  are  k  such  places  to  insert  the  new  element,  obviously 
there  are  k  new  chains  of  length  k  for  each  chain  of  length  k  —  1. 
Therefore,  Nk  =  kNk_1}  a  recursion  relation  yielding,  with 
N2  =  1,  the  unique  solution  Nk  =  (k  —  1)! 

The  total  number  of  independent  inequalities  that  any  r 
empirically  observed  points  must  satisfy  is  the  sum  of  all  possible 
chain-inequalities  of  length  k  ^  r,  and  therefore  equals  in  number 


+ 


('2)»,+(;)a'.+...+(;)Jv.=q.+q2+q3i 
+(,:,)  (r-»i+fr-i)i-i(;)(»-i)i 

For  r  =  2,  3,  4,  5,  we  already  have  1,  5,  20,  74  independent 
inequalities  to  be  satisfied,  and  the  number  rises  rapidly  with  r. 
(Incidentally,  there  are  the  same  number  of  Hicksian  equalities, 
of  the  type  given  in  (21)  that  must  be  approximately  satisfied 
for  nearby  points.) 

This  may  all  be  summarized  as  follows. 

Theorem  6:  Any  r  distinct  observations  (x1;  y1),  .  .  .,(xr;  yr) 

arising  from  a  definite  minimum  system  must  satisfy  2£Q 

(^—1) !  independent  inequalities  of  the  type 
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k 

2  [y1Ax1  +  y2Ax2  +  ...]>  0,     h  ^  r 

where  the  summation  is  over  a  circular  chain  consisting  of  k 
distinct  points  and  returning  to  the  original  point  and  where 
the  superscripts  in  y)  and  in  x)  —  xf'1  —  Axi  have  been 
omitted.  Similar,  but  non-independent,  dual  relations  involving 
Ay  and  x  can  be  written. 

13.  Relations  of  the  type  given  in  Theorem  5  are  necessary 
conditions:  violation  of  any  one  of  them  could  serve  as  a  refuta- 
tion of  the  hypothesis  that  we  are  dealing  with  a  regular  minimum 
system.  (If  we  drop  regularity  assumptions,  equalities  must  be 
added  to  the  inequalities  but  otherwise  the  theorem  is  still 
valid.)  The  question  naturally  arises  as  to  whether  there  can  be 
any  further  necessary  conditions  beyond  the  ones  already  de- 
rived. That  is,  are  these  necessary  conditions  in  some  sense  also 
sufficient  conditions  for  a  regular  minimum  system?  The  following 
answer  can  be  given. 

Theorem    7:    Given    n    smoothly    differentiable    relations 
holding  between  2n  conjugate  variables,  namely  yl = Y  i(xlt  ...,xn), 
i  =  1,  2,  .  .  .,  n,  and  if  there  can  never  arise  for  any  finite 
number  of  observations  a  violation  of  any  of  the  inequalities 
of  Theorem  6,  we  are  then  assured  that  the  system  is  a  regular 
minimum   one,   with   symmetric   positive   definite   Jacobian 
matrix  and  all  the  other  properties  of  such  a  system. 
To  prove  this,  consider  a  chain  of  m  distinct  points  that  lie 
along  any  simple  closed  path  or  contour  in  the  n  dimensional 
(xv  .  .  .,  xn)   space,   with  1,  2,  .  .  .,  m  being  arrayed  by  conven- 
tion in  counterclockwise  order.  Define  Sm  =  ^[^ly^itf—tf*1)] 
and  sm  =  2rEiV K:  ~  *-+1)]-    %    (27)    and    (28)    and    our 
present  hypothesis,  it  is  to  be  true  that  Sm  is  positive  and  sm 
is  negative  for  any  m.  Now  let  the  number  of  observations,  my 
along  the  same  simple  closed  contour  increase  indefinitely  in 
such  a  way  as  to  make  the  distance  between  any  two  adjacent 
points  go  to  zero  in  the  limit.  Then  clearly  from  the  definition 
of  an  integral  as  the  limit  of  a  sum, 


lim  Sm  =  ^JyjdXj  =  lim  s 


m— >oo 
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where  the  indicated  line  integral  is  taken  counterclockwise 
around  the  specified  simple  closed  contour  and  where  y5  =  Y* 
(xlt  .  .  .,  xn).  Since  this  line  integral  is  the  limit  of  a  sequence  of 
positive  numbers  and  at  the  same  time  is  the  limit  of  a  sequence 
of  negative  numbers,  its  value  is  proved  to  be  zero  along 
any  closed  contour.  Hence  the  expression  ^jjicixj  must  be  an 
exact  differential  dY(xv  .  .  .,  xn),  with  yi  =  Yi(xiI  .  .  .,  xn)  = 
Yt{xv  .  .  .,  xn)  and  Y]  =  Y{.  This  proves  the  symmetry  or 
variational  aspect  of  the  problem. 

To  prove  the  definiteness,  we  may  consider  the  inequality 
that  must  hold  between  any  pair  of  points;  given  our  assumptions 
concerning  differentiability,  this  may  be  written  in  the  Taylor's 
expansion 

1  11 

where  not  all  Axi  are  zero  and  where  the  second  partial  derivatives 
are  evaluated  at  some  point  intermediate  between  the  two 
points  in  question.  Clearly  at  such  a  point  the  quadratic  form 
made  up  of  the  Hessian  matrix  must  be  positive  definite;  and 
the  same  must  be  true  of  every  observed  point,  else  we  could 
write  down  a  contradiction  to  our  finite  inequality.  This  com- 
pletes the  proof. 

The  interpretation  of  this  theorem  is  rather  delicate.  It  shows 
that  we  can,  by  taking  enough  observations  close  enough 
together,  always  succeed  in  detecting  any  deviation  of  the  facts 
from  the  regular  minimum  hypothesis.  But  it  does  not  tell  us 
how  many  observations  will  be  necessary.6)  Therefore,  no  matter 
how  many  observations  we  may  have  examined,  all  of  which 
satisfy  all  requisite  inequalities,  we  cannot  be  sure  but  that 
still  further  observations  might  show  a  violation  of  the  hypo- 
thesis. This  is  a  common  situation:  methodologically,  we  can 

6)  In  Economica,  February,  1953,  p.  9  I  posed  the  "open  question"  as  to 
how  many  situations  m  might  be  needed  to  test  the  existence  of  an  integrable 
field.  Were  n  —  2,  vve  know  that  m  =  2  would  be  sufficient.  For  n  ^  3  is  there 
a  similar  finite  m  =  &(n)  bound  on  the  number  of  situations  that  might  be 
needed  to  refute  integrability?  The  question  can  now  be  closed  by  the  statement: 
when  there  is  definitely  non-integrability,  the  closer  the  conditions  for  inte- 
grability are  to  being  fulfilled,  the  larger  must  be  the  m  needed  to  reveal  that 
non-integrability;  so  there  is  no  a  priori  finite  bound  on  m  possible. 
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refute  a  hypothesis  with  a  finite  number  of  observations,  but  we 
can  never  strictly  "confirm"  it  in  this  way.  Furthermore,  the 
theorem  does  not  tell  us  that,  for  fixed  m,  satisfying  all  the 
prescribed  inequalities  is  all  that  can  be  required  of  the  ob- 
servations: i.e.,  it  does  not  prove  the  non-existence  of  any  further 
independent  implications  of  the  regular  minimum  hypothesis. 
I  would  conjecture,  from  various  algebraic  and  geometric 
considerations,  that  no  further  independent  inequalities  can  be 
prescribed;  but  this  has  not  yet  been  proved. 

14.  This  completes  the  program  of  characterization  of  the 
necessary  and  sufficient  conditions  for  a  system  to  have  the 
structure  of  a  regular  minimum.7)  A  number  of  further  problems 
may  be  briefly  mentioned.  First,  the  smoothness  and  uni- 
queness assumptions  involved  in  the  notion  of  "regularity" 
may  be  dropped,  and  investigations  may  be  made  of  the  resulting 
modifications  in  the  theorems.  With  partial  derivatives  not 
being  defined  at  sets  of  points,  the  Jacobian  and  Hessians  of 
the  first  part  of  this  paper  will  then  not  necessarily  exist  or  be 

7)  The  rather  ambiguous  principle  of  Le  Chatelier,  in  its  correct  form 
*i(yi.  •  •  •>  Vn)  ^  xl(yv  •  •  •>  Vn-x,  a>«)  ^  .  .  .  ^  x\{yx;  x2,  .  .  .,  xn)  ^  0 

can  be  derived  from  our  conditions,  as  can  the  equivalent  dual  form  which 
says  that  y^{yx,  •  •  •>  Vm>  •  •  ■>  xn)  must  be  positive  and  non-decreasing  with  m. 
But  adding  the  same  finite  positive  Ayx  to  each  (yx,  .  .  .,  ym;  xm+1,  .  .  .,  xn), 
all  of  which  correspond  to  the  same  initial  equilibrium  point,  need  not  lead 
to  the  finite  inequalities 

^toi+dyv  •  •  •>  v%)  ^vx{yi+Ayi,  • .  ■,  y«_x; »«)  ^  . .  •  ^  ^(y1+Ay1;  xt, . . .,  xn). 

For  sufficiently  small  changes  in  yx  these  finite  inequalities  must  hold,  but  the 
proof  is  not  immediate.  For  finite  moves  of  any  size,  it  is  easy  to  give  counter- 
examples that  are  nonetheless  perfectly  regular.  This  may  be  regarded  as 
slightly  paradoxical  since  it  says  that  from  any  point  in  the  (pressure,  volume) 
diagram,  there  will  be  an  isothermal  line  "flatter"  than  an  adiabatic  line; 
yet,  the  finite  change  in  pressure  in  a  system  when  you  change  its  volume 
holding  temperature  constant,  can  turn  out  to  be  greater  than  the  change  in 
pressure  resulting  from  an  equal  change  in  volume  with  entropy  being  held 
constant.  The  paradox  is  disspelled  if  one  realizes  that 

rAvx 
J0      0^(2/1  +  t,  .  .  .,  ym;  .  .  .,  xn)  -  x\{yx  +  /,...,  ym+1;  .  .  .,  xn)}dt 

has  an  integral  that  starts  out  non-negative  at  t  =  0  but  which  may  become 
negative  as  the  equilibrium  points  being  compared  become  different  ones. 
The  last  vestige  of  paradox  disappears  when  we  realize  that  regular  systems 
need  not  admit  of  (p,  v)  as  prescribable  coordinates;  hence,  any  point  in  this 
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in  vert  able.  Extensions  of  the  Implicit  Function  Theorem  and 
of  the  concept  of  one-to-one  correspondence  will  lead  to  results, 
of  which  the  regular  systems  constitute  one  special  case.  As  far 
as  the  finite  inequalities  of  the  latter  part  of  the  paper  are 
concerned,  with  the  addition  of  equalities  to  the  inequalities, 
they  will  all  remain  valid. 

A  second,  quite  different  extension  of  the  present  program  is 
to  apply  it  to  the  study  of  regular  systems  involving  an  in- 
finite number  of  variables  or  the  equivalent.  Thus,  the  differential 
equations  of  classical  mechanics  can  be  regarded  as  functional 
systems  that  are  limiting  cases  involving  an  infinite  number  of 
variables.  The  question  naturally  arises  as  to  an  exhaustive 
summary  of  the  necessary  and  sufficient  conditions  that  the 
observable  motions  must  satisfy  if  they  are  to  have  arisen 
from  a  regular  variational  system  that  is  or  is  not  a  true  minimum 
system.  As  far  as  I  know,  no  one  has  yet  carried  through  such 
a  program  in  the  full  detail  of  the  present  examination  of  the 
simpler  case  of  a  finite  number  of  functions  of  a  finite  number 
of  variables.  The  present  discussion  shows  some  of  the  problems 
that  will  certainly  be  encountered  in  the  more  difficult  realm  of 
function  spaces. 


diagram  may  correspond  to  more   than   one   physical   state   of  the   system. 
A  slight  generalization  of  the  Le  Chatelier  principle  can  be  stated  in  terms 
of  quadratic  forms,  namely 

r  r  r  r 

SSa3<(^,  .  .  .,  ym;  .  .  .,  <BB)My  ^  SS  x){yx,  .  .  .,  yk;  .  .  .  aj/^  A, 
11  11 

where  r  ^  k  <  m.  This  is  provable  directly  from  the  form  of  /„  in  (6)  and  its 
dual. 


APPENDIX 


Axiomatic  Basis  For  Equilibrium  Relations  in  Classical 
Thermodynamics. 


1.  The  formal  mathematical  analogy  between  classical  thermo- 
dynamics and  mathematic  economic  systems  has  now  been 
explored.  This  does  not  warrant  the  commonly  met  attempt 
to  find  more  exact  analogies  of  physical  magnitudes  —  such 
as  entropy  or  energy  —  in  the  economic  realm.  Why  should 
there  be  laws  like  the  first  or  second  laws  of  thermodynamics 
holding  in  the  economic  realm?  Why  should  "utility"  be  literally 
identified  with  entropy,  energy,  or  anything  else?  Why  should 
a  failure  to  make  such  a  successful  identification  lead  anyone 
to  overlook  or  deny  the  mathematical  isomorphism  that  does 
exist  between  minimum  systems  that  arise  in  different  dis- 
ciplines? 

In  this  Appendix,  some  of  the  special  differences  between  the 
variables  and  relations  that  arise  in  thermodynamics  and  in 
economics  will  be  briefly  explored.  This  will  involve  giving  a 
slightly  unconventional  axiomatic  base  for  classical  thermo- 
dynamics, or  at  least  for  that  part  which  deals  exclusively  with 
equilibrium  states  and  which  might  be  called  thermostatics.1) 

2.  While  in  economics  the  variables  (xit  y^  will  usually  be 
observable  quantities  of  goods  and  their  respective  prices,  in 
thermodynamics  they  will  not  all  be  as  immediately  observable 
as  is  the  case  of  volume  and  (negative)  pressure.  Thus,  (xv  yx) 
might  correspond  to  "entropy"  and  "absolute  temperature," 
which  to  early  writers  were  not  known,  directly  observable 
quantities.  And  a  function  like  Y(xv  .  .  .,  xn),  which  in  economics 
might  be  easily  measured  as  dollar  revenue,  in  thermodynamics 

x)  Such  an  axiomatic  excursion  departs  from  the  usual  Clausius  formulation, 
in  terms  of  the  first  and  second  laws.  (Cf.  also  the  work  of  Carnot,  Kelvin, 
Gibbs,  and  Planck).  It  resembles  a  little  the  alternative  axiomatic  approach 
of  C.  Caratheodory,  Mathematische  Annalen,  67  (1909)  pp.  355 — 86.  See  M. 
Born,  Natural  Philosophy  of  Cause  and  Chance  (Oxford,  1949),  Ch.  V  for  a  good 
account  and  for  references. 
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might  correspond  to  an  internal  energy  function  whose  existence 
and  properties  have  to  be  established  by  intricate  reasoning. 

Fig.  1  plots  the  properties  of  a  simple  system  whose  states 
can,  for  simplicity,  be  assumed  to  be  determined  by  pressure 
and  volume  (ft,  v).  The  solid  contours  represent  loci  of  equal 
"temperature"  and  as  yet  have  no  natural,  preferred  numbering 
on  them  corresponding  to  absolute  temperature,  t.  Instead 
T  =  T(v,  ft)  represents  any  monotone  numbering;  and  any 
renumbering  T'  =  f(T)  =  T'(v,  fi),  with  /  an  arbitrary  mono- 
tone-increasing function,  would  be  an  equally  good  indicator 
of  greater  and  smaller  temperatures. 

The  broken  contours  represent  "adiabatics"  and  represent 
loci  of  points  that  could  be  observed  if  the  system  were  "in- 
sulated" from  its  environment  by  perfect  non-conducting  walls. 
I  omit  a  good  deal  of  explanatory  matter  needed  to  establish 
the  exact  nature  of  these  two  sets  of  contours,  instead  taking 
them  as  the  given  primitive  concepts  of  the  axiomatic  system. 
Note  that  it  would  be  premature  to  call  the  adiabatics  "isen- 
tropes"  since  we  do  not  yet  have  any  entropy  magnitude  to  be 
held  constant.  Yet  we  can  give  an  arbitrary  numbering  to  the 
adiabatics,  written  as  5  =  S(v,  ft)  or  as  S'  =  F(S)  =  S'(v,  ft), 
where  again  F  is  an  arbitrary  monotone-increasing  function 
which  will  preserve  the  direction  of  the  ordering. 

3.  The  first  task  of  our  thermodynamic  system  is  to  define 
its  own  "canonical  entropy  and  temperature' '  so  that  we  can 
go  from  the  arbitrary  scaling  of  T  and  5  to  a  privileged  canonical 
temperature  t  and  canonical  entropy  s;  when  this  is  done  we  shall 
be  able  to  define  an  internal  energy  function  e  =  e(v,  ft)  or 
e  —  E(s,  v). 

Note  that  Fig.  1  is  drawn  with  the  special  property  of  "pro- 
portional areas."  Thus  the  shaded  area  A  is  to  B  as  the  area 
A'  is  to  B'.  And  the  same  equal  proportionality  would  be  true 
of  the  appropriate  curved  parallelograms  formed  by  any  four 
thermal  contours  intersecting  with  any  four  adiabatics. 

This  equal  ftroftortionality  of  areas  is  not  an  accident:  it  is  a 
fundamental  regularity  of  nature,  from  which  we  can  deduce  the 
existence  of  an  internal  energy,  a  canonical  temfterature  and 
canonical  entrofty. 
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4.  In  fact  pick  any  thermal  contour  aa  as  the  arbitrary  origin 
for  temperature  and  any  other  thermal  contour  bb  as  the  arbi- 
trary t  ==  1  level  which  will  set  the  scale  of  one  unit  of  canonical 


Fig.  l 


temperature.  We  can  now  give  a  determinate  number  t  to  any 
arbitrarily  given  thermal  contour  by  making  its  t  the  ratio  of 
the  algebraic  area  that  it  and  any  pair  of  adiabatics  form  with 
aa  relative  to  the  area  formed  by  the  unit  temperature  contour. 
Fig.  2.  shows  the  resulting  canonical  temperature  scaling  for 
each  contour.  (Verify  that  cc  corresponds  to  t  =  1.5  because 
the  area  C  -f-  B  is  one  and  a  half  times  as  great  as  the  area  C 
alone;  of  course  (C  +  B')jB'  gives  the  same  ratio.  And  dd 
corresponds  to  t  =  —2.0  because  the  area  A  below  aa  is  twice 
the  absolute  size  of  the  area  C.) 

5.  Having  defined  canonical  t,  we  have  no  further  use  for 
T  or  T'.  And  we  can  now  go  from  S  or  S'  to  canonical  s,  called 
entropy.  This  may  be  measured  from  any  point  selected  as  an 
arbitrary  origin:  thus  the  point  northwest  of  A  may  be  taken 
as  zero  origin,  although  it  is  to  be  understood  that  this  entropy 
origin  would  not  have  to  be  at  a  point  where  t  =  0. 

Once  the  dimension  of  pressure  is  selected,  the  scale  of  5  is 
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not  arbitrary.  Suppose  f$p  has  been  selected  just  the  proper 
distance  from  aa  as  to  make  the  area  C  (which  has  the  dimension 
of  pressure  times  volume)  be  exactly  unity  in  area.  Then  fifi  is 
given  the  designation  s  =  1.0  and  every  other  adiabatic  is 
given  a  unique  s  determined  by  the  algebraic  value  of  the 
area  which  it  makes  with  aa,  bb,  and  aa.  (If  the  adiabatic  is 
to  the  left  of  aa  we  give  it  a  negative  entropy  with  magnitude 
proportional  to  appropriate  absolute  area.  See  Fig.  2  for  il- 
lustrative s  values.) 


1.0  v 

Fig.  2 

6.  Now  that  we  have  defined  the  functions  t  =  t(v,  ft)  and 
s  =  s(v,  ft),  we  can  verify  that  the  equal  proportionality  of  area 
property  implies  that  the  line  integral 
(Al)  t(v,  ft)  ds{v,  ft)  —  ftdv 

must  be  independent  of  path  and  expressible  as  the  exact  dif- 
ferential de(v,  ft)  of  an  existent  "canonical  internal  energy" 
function  e  =  e(v,  ft),  which  is  arbitrary  in  its  origin  because  of 
the  constant  of  integration. 

Because  the  canonical  temperature  t  is  arbitrary  in  its  origin, 
transforming  from  t  to  t  +  t0,  would  introduce  an  arbitrary  linear 
entropy  term  t0(s  —  s0)  in  our  expression  for  canonical  energy. 
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Within  the  realm  of  thermostatics,  where  we  merely  observe 
equilibrium  states  and  make  no  measurements  of  "irreversible 
processes/'  we  can  never  hope  to  remove  this  ambiguity  in  the 
definition  of  canonical  internal  energy.2)  We  can  never  hope 
to  deduce  the  level  of  absolute  zero,  which  characterizes  the 
Kelvin  scale.  We  cannot  define  the  "efficiency"  of  an  hypo- 
thetical Carnot  cycle  from  knowledge  of  the  isotherms  and  adia- 
batics  alone.  (In  fact  the  present  treatment  never  explicitly 
mentions  "heat,"  inexact  differentials  like  dQ,  or  exact  differen- 
tials like  dQ/t.) 

7.  Under  our  simplifying  assumption  that  (v,  p)  are  admissible 
state  variables,  we  can  rewrite  internal  energy  e  as  a  function 
of  the  pair  of  variables  (s,  v)  to  get  E(s,  v)  such  that 

(A2)  dE(s,  v)  =  t(s,  v)ds  —  p(s,  v)dv. 

Here  E(s,  v)  corresponds  precisely  to  my  earlier  Y(xv  x2,  .  .  .); 
and  it  would  be  easy  to  show  that  the  Gibbs  potential 
e  —  ts  —  {—p)v,  written  as  a  function  of  t  and  —  p,  corresponds 
to  the  dual  X(ylt  y2,  .  .  .).  Corresponding  to  P(y1')  x2,  .  .  .)  would 
be  the  Helmholtz  free  energy  e  —  ts  written  as  a  function  of 
t  and  v. 

8.  All  of  the  above  implications  of  the  equal-proportional 
areas  axiom  can  be  given  brief  mathematical  summary3)  in  the 
case  where  all  the  functions  have  nice  (overly  strong)  differen- 
tiability and  regularity  properties:  in  particular  where  any  two 
T(s,  p)  and  S(v,  p)  contours  intersect  in  one  and  only  one  point 
so  that  these  functions  can  be  inverted  to  give  v  =  v(S,  T) 
and  p  =  p(S,  T)  with  smooth  partial  derivatives. 

Suppose  we  know  that  indicators  of  temperature  and  entropy 
can  be  converted  by  determinate  (save  for  arbitrary  origin 
constants  and  scale  constants)  functions  t  =  f(T)  and  s  =  F(S) 
to  yield  canonical  temperature  and  energy;  and  that  there  exists 
a  canonical  internal  energy  function  e  =  E(s,  v)  with  the 
Maxwell  reciprocity  or  integrability  property 

2)  This  was  noted  by  Caratheodory,  op.  cit.,  p.  381.  See  also  A.  Lande, 
Handbuch  der  Physik,  Band  IX,  281  —  300  for  a  treatment  related  to  the 
present  one. 

3)  This  and  the  next  section  can  easily  be  skipped. 
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d2E(s,  v)       d2E(s,  V) 


(A3) 


dvds  dsdv 


From  the  form  of  the  exact  differential  in  (A2),  this  is  equivalent 
to 

dt(s,  v)_       dp{x,  v)  . 
(A4)  ~d^  =  fc~' 

which  in  terms  of  arbitrary  T  and  5  and  f(T)  and  F(S)  becomes 


m 

\ds/, 


(A5)  ^f{T)F'(S). 

\dvj  s 

Now  the  left-hand  side  of  this  relation  is  a  perfectly  observable 
magnitude,  being  a  function  of  (v,  ft)  or  (5,  T)  or  any  other 
two  variables.  Let  us  call  it  /.  The  equal-proportional  area 
property  —  which  is  an  integrability  property  in  the  large  — 
is  equivalent  to  saying  that  this  observable  left-hand  expression 
/  is  truly  a  product  of  two  separate  functions.  I.e.,  necessarily 
log  J  must  be  the  sum  of  a  function  of  T  and  a  function  of  S, 
so  that 

(A6)  a21°!/f  r^0 

is  a  necessary  and  sufficient  condition  for  the  equal-proportional 
area  property  to  hold. 

An  alternative  functional  equation  which  J  must   satisfy  is 


(A7) 


J(S»  Tx)     J(S2,  t\; 
J(SV  T2)      J(S2>  T2] 


=  0 


for  all  (Sv  7\)  and  (S2,  T2). 

Both  (A6)  and  (A7)  are  observable  conditions  which  could 
be  empirically  verified. 

Incidentally,  by  some  manipulations  of  implicit  function 
theory,  it  is  easy  to  show  that  /  is  a  certain  Jacobian  determinant, 
namely 
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(A8) 


\dsJ* 

\dvl. 


1 

dv  dv 
dSdf 
dp  dp 
dSdf 

dsds 

dv  dp 
dTdT 
dv  dp 

d(v,  p) 
d(S,  T) 


=  J(S,T). 


9.  Given  the  observable  function  J(S,   T),  we  can  derive 
canonical  temperature  and  entropy  by  simple  integrations.  Thus 


t  =  f(T) 


(A9) 


=  h  +  f 


«o  +  f   f'{r)dr 


'.J(S0,  T( 


dr, 


where  t0  =  f{T0)  is  an  arbitrary  origin  constant  and  f'(T0) 
an  arbitrary  scale  constant.  Also 


is 


(A10) 


=  F(S)  =  s0+  [S  F'(o)dc 
JsQ 

•'-So    /  (T0) 


Because  /  has  the  remarkable  multiplicative  property,  this 
f(T)  integral  will  be  independent  of  the  50  level  and  (AlO)'s 
F  (S)  integral  will  necessarily  be  independent  of  the  T0  level.  Note 
that  the  origin  of  s  is  arbitrary,  but  the  scale  is  definitely  not. 

Since  the  line  integral  tds  —  pdv  is  independent  of  path,  we 
can  by  a  variety  of  alternative  integrations  calculate  the  canonical 
internal  energy  function  E(s,  v),  which  is  determinate  except  for 
an  arbitrary  linear  term  £0(s  —  s0). 

10.  So  far  the  discussion  dealt  with  simple  systems  with  only 
two  degrees  of  freedom:  e.g.  a  single  homogeneous  fluid.  The 
existence  of  an  absolute  or  Kelvin  temperature  which  is  the  same 
for  all  substances  shows  that  the  equal-proportional  area  law 
must  apply  also  to  the  interrelations  of  different  bodies.  Thus 
Fig.  3  shows  that  if  two  bodies  are  in  respective  equilibrium 
at  t  =  0  and  at  t  =  1,  then  we  can  separately  for  each  calculate 
the  canonical  temperatures  t±  =  \  and  t2  =  \  by  our  previously 
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described  techniques  of  canonical  temperatures.  Then  we  can 
confidently  predict  —  in  advance  of  ever  having  made  the 
experiment !  —  that  the  tx  =  \  and  t2  =  \  contours  will  truly 
represent  the  same  temperature  and  represent  mutual  equi- 
librium. 


P. 


Fig.  3 

This  requires  that  the  area  relations  A1/B1  =  A2jB2  hold 
between  different  substances.  (In  fact  this  is  the  generalized  form 
of  our  equal-proportional  area  axiom,  from  which  all  the  other 
relations  of  thermostatics  follow.) 

11.  There  are  far-reaching  implications  of  the  axiom.  Thus, 
suppose  two  (or  more)  bodies  are  separated  by  diathermous 
walls  (i.e.  walls  which  require  the  bodies'  mutual  equilibrium 
to  be  at  equal  temperatures)  and  that  these  walls  are  also  elastic 
(i.e.  such  as  to  require  equal  pressures  at  equilibrium).  Then 

h{H>  ^i)  =  h(H>  vz)  =  ...  =  t  =  f(T) 
fii^v  Vi)  =  P*(s*  v2)  =  .  .  .  =  p. 
But  in  terms  of  the  internal  energies  £*($*,  wf),  this  means 


(Air 


(A12) 


ds± 
d£i(*i>  Pi) 


dE2(s2,  v2) 

ds2 
dE2(s2,  v2) 

dv9 


-fi> 
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which  in  turn  is  seen  to  be  the  necessary  condition  that 
g.===  ex  +  02  +  •  •  •  ==  £i(si>  vi)  +  ^2(52»  %)■+•••  be  at  a  min- 
imum subject  to 

(A13)  sx  +  s2  +  .  .  .  =  s 

v  »i "+  «2  +  ...=»  v; 

and  that  the  resulting  minimized  energy  be  a  function  of  the 
total  s  and  v  respectively  —  namely 

minimum  e  =  E(s1  +  s2  +  .  .  .,  vx  +  v2  +  .  .  .)  =  2s(s,  v)  with 

as  ov 

where  again  an  arbitrary  linear  term  in  total  entropy  t0(s  —  s0) 
will  be  involved.4) 

For  a  true  maximum,  it  is  sufficient  that  each  E^v^  sj  have 
a  positive  definite  Hessian  matrix  of  second  partial  derivatives 


H< 


Et   d*E{ 


d2E„.   d2E„. 


dSi  dVi    dv\ 

and  then  it  will  follow  that  the  Hessian  matrix  of  E(s,  v)  is 
positive  definite.40) 

4)  If  the  bodies  are  in  thermal  contact  but  separated  by  rigid  walls,  the 
pressure  equality  of  (A12)  is  lost  and  the  volumes  (vx,  v2,  .  .  .)  are  separately 
specifiable:  we  then  get  e  =  ex  +  e%  +  •  •  •  =  E(si  +  ^2  +  •  •  •»  vi>  vz  •  •  •)  m 
minimized  form,  with  BEjdVi  =  —pi  and  dEjds  =  t. 

ia)  Many  treatises  on  thermodynamics  seem  to  be  saying  that  E(s,  v)  is  at 
a  minimum  for  prescribed  s  and  v.  Actually  E(s,  v)  is  a  determinate  function 
of  its  arguments  and  is  not  free  to  be  at  a  maximum  or  a  minimum.  It  is  the 
sum  £1(s1,  vx)  +  E2(sz,  v2)  which  has  its  (st-,  v{)  varied  subject  to  fixed  sx  +  s2 
and  vx  +  v2  until  Ex  +  E2  is  at  a  minimum.  When  the  resulting  optimal  (Sj-,  v{) 
are  substituted  into  E1(s1,  vx)  +  E2(s2,  v2)  the  result  defines  E(s,  v). 

That  the  Gibbs  free  energy  or  potential  is  at  a  minimum  for  fixed  t  and  p 
likewise  requires  careful  statement.  Write  the  expression  e  —  ts  —  {  —  p)v, 
regarded  as  a  function  of  t  and  p  alone,  as  G(t,  p).  Then  G(t,  p)  cannot  be  at  a 
minimum  for  arbitrarily  prescribed  (t,  p),  nor  at  a  maximum,  nor  anything 
else  but  at  its  determinate  value.  Rather  it  is  G (s,  v;  t,  p)  =  E(s,  v)  —  ts  —{—p)v 
that  is  at  a  minimum  with  respect  to  (5,  v)  for  prescribed  (t,  p) .  This  can  be  shown 
to  be  a  special  case  of  the  condition  that  Ex  +  E2  is  at  a  minimum  for  prescribed 
vx  +  v2  and  sx  +  s2.  Think  of  5,  v  and  E  as  belonging  to  one  body,  and  hence 
write  them  as  sx,  vx,  Ex(sx,  vx).  Now  consider  a  second  body  so  large  compared 
to  the  first  that  its  temperature  and  pressure  will  be  affected  negligibly  by 
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12.  Note  that  in  contrast  to  the  Caratheodory-Born  treat- 
ment, there  was  here  no  stipulation  of  any  integrability  con- 
ditions or  inaccessibility  conditions  other  than  that  of  equal- 
proportional  area.  Once  the  Ti  =  T^v^  pt)  and  St  =  Si(vit  ft *) 
contours  were  specified  and  the  equal-proportional  area  axiom 
of  a  single  uniform  canonical  temperature  was  specified,  then 
all  the  remaining  relations  of  thermostatics  logically  followed. 

13.  Instead  of  minimizing  total  energy  e  for  given  total 
entropy  s  and  total  volume,  with  t  >  0  we  could  have  maximized 
entropy  for  given  total  energy  and  volume.  Leaving  thermostatics 
for  true  thermodynamics  and  actually  observing  the  time 
changes  of  a  given  isolated  system,  we  would  actually  find  that 
total  energy  would  remain  constant  and  total  entropy  would 
increase  in  time.  But  within  the  realm  of  thermostatics  —  i.e. 
the  study  of  equilibrium  relations  like  (All)  —  we  would  have 
no  knowledge  of  this  or  interest  in  it. 

14.  The  present  formulation  5)  thus,  seems  to  have  turned 
the  usual  formulation  upside  down.  Rather  than  begin  with 
the  first  and  second  laws  of  thermodynamics  and  then  deduce 
equilibrium  relations  and  the  existence  of  a  universal  absolute 
temperature,  we  have  assumed  a  universal  canonical  temperature 
(with  arbitrary  origin)  and  have  then  deduced  equilibrium 
relations  and  integrability  conditions.  We  have  assumed  less 
than  the  first  and  second  laws:  and  we  deduce  less.  We  deduce 
only  thermostatic  relations,  deducing  almost  nothing  about 
irreversible  processes.  (In  the  original  definition  of  the  adia- 
batics  Si  =  Sityi,  ftt)   there  does  lurk  in  the  background  an 

the  first.   (A  "heat  and  pressure  bath.")  Write  its  energy  as  E2(s2,  v2)  = 
ts2  —  {—p)v2,  where  t  and  p  are  fixed  parameters.  Now  minimize  Ex(sx,  wx)  + 
E2(s2,  v2)  subject  to  fixed  s  =  sx  +  s2  and  v  =  vx  +  v2  to  get 
Minimum  Ex(sx,  vx)  +  E2(s2,  v2)  =  [Ex{slt  vx)  —  isx  —  (—  p)vx]  +  ?s  +  (— p)v 

=  Gx(sx,  vx\  i,  p)  -f  a  constant. 

This  last  expression  shows  that  minimizing  the  Gibbs  Gx  is  a  special  case  of 
minimizing  Ex  +  E2,  the  total  of  the  observed  system  and  its  "environment." 
5)  The  equal-proportional  area  property  on  which  all  this  is  based  was 
suggested  to  me  by  an  obvious  extension  of  the  reasoning  of  Maxwell's  Theory 
of  Heat  (1871  edition),  but  I  have  not  seen  it  explicitly  alluded  to  in  any 
twentieth  century  book.  It  is,  however,  so  obvious  and  basic  a  property  that  it 
must  have  been  rediscovered  many  times. 
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implicit  reference  to  irreversible  processes  and  two-dimensional 
"inaccessible  neighboring  states."  But  once  the  primitive  notion 
of  an  adiabatic  is  assumed,  all  the  rest  is  thermostatics.) 

The  Gibbs  thermodynamics  of  heterogeneous  substances  has 
not  been  here  discussed.  But  what  was  done  here  for  —jpdv 
areas  could  also  be  done  for  —J  iiidMi  areas,  where  fi{  represents 
chemical  potentials  and  Mi  chemical  masses;  and  it  would  seem 
that  no  new  mathematical  problems  would  be  raised. 
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Decentralization  and  Computation 
in  Resource  Allocation 

I.  FORMULATION  OF  THE  PROBLEM 

A.  Introduction. 

In  this  paper,  we  wish  to  discuss  the  bearing  of  some  recent 
developments  in  mathematical  economics  on  the  problem  of  the 
optimal  allocation  of  resources.  We  will  confine  attention  here 
to  an  economy  whose  aims  are  well  defined.  That  is,  we  assume 
that  the  preferences  of  the  economic  system  can  be  embodied 
in  a  utility  function  which  depends  upon  the  outputs  of  com- 
modities. For  a  given  technology,  the  possibilities  of  different 
output  combinations  are  restricted  by  the  availabilities  of 
primary  resources.  The  problem  of  optimal  resource  allocation 
is  to  choose  among  all  the  feasible  combinations  of  production 
processes  that  combination  which  maximizes  the  utility  achieved 
by  the  economy. 

Since  the  discussion  is  at  a  fairly  high  level  of  abstraction,  the 
economy  being  studied  may  be  a  nation  or  some  smaller  economic 
system,  including  a  single  firm.  The  assumption  that  a  single 
utility  function  represents  the  objectives  of  the  economy  fits 
best  the  case  of  a  firm.  For  a  nation,  the  assumption  is  less 
justified,  but  it  provides  an  introduction,  at  least,  to  the  more 
complex  problem  raised  by  the  presence  of  many  individuals, 
each  of  whom  judges  the  workings  of  the  economic  system  in 
light  of  his  own  utility  function.  We  also  avoid  the  subtle 
problems  involved  in  defining  optimality  in  the  more  general  case. 

The  problem  of  choosing  the  allocation  of  primary  resources 

*  The  research  reported  on  in  this  paper  was  to  a  large  extent  carried  on 
under  the  auspices  of  the  RAND  Corporation.  We  also  wish  to  thank  the 
Office  of  Naval  Research  and  the  Center  for  Advanced  Study  in  the  Behavioral 
Sciences  for  their  assistance  and  M.  J.  Beckmann  and  W.  M.  Gormon  for  their 
comments. 
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among  different  productive  processes  so  as  to  maximize  a 
prescribed  utility  function  is  a  mathematical  one,  and  its 
solution  in  any  concrete  case  can  be  regarded  as  a  matter  of 
computation.  Now  any  computation,  beyond  the  very  simplest, 
involves  a  process  of  successive  approximations,  that  is,  a  process 
carried  out  in  several  steps,  where  the  calculations  in  each  step 
make  use  of  the  results  of  the  preceding  steps  and  one  of  the 
results  of  each  step  is  an  approximation  to  the  desired  answer. 
(The  ordinary  process  of  long  division  is  an  example  of  a  process 
of  successive  approximations;  the  results  of  each  trial  division 
step  are  another  digit  to  be  added  to  the  quotient  and  a  new 
dividend  to  be  used  in  the  next  trial  division.) 

That  the  market  place  solves  the  economic  problem  of  equating 
supply  and  demand  by  successive  approximations  to  the  equi- 
librating price  or  prices  is  a  concept  familiar  from  elementary 
economics  textbooks.  In  a  single  market,  each  approximation 
results  in  naming  a  price  and  calculating  the  difference  between 
demand  and  supply  at  that  price;  the  next  approximation  in- 
volves adjusting  the  previous  trial  price  in  a  manner  governed 
by  this  difference,  with  the  idea  of  ultimately  wiping  it  out. 
The  notion  of  this  kind  of  dynamics  in  the  movement  of  prices 
has  its  roots  in  the  English  classical  economists,  perhaps  par- 
ticularly John  Stuart  Mill,  but  it  received  its  first  explicit 
recognition,  particularly  with  reference  to  simultaneous  succes- 
sive approximations  to  all  prices,  in  the  concept  of  tatonnements 
of  Leon  Walras.1) 

Since  welfare  economics  assures  us  that  under  certain  as- 
sumptions as  to  the  utility  function  and  the  productive  process 
(see  part  III,  section  A  below  for  a  more  explicit  discussion  of 
these  assumptions)  a  competitive  equilibrium  can  be  identified 
with  an  economic  optimum,  we  may  conclude  that  the  method 
of  successive  approximations  which  solves  the  problem  of  market 

x)  See  L.  Walras,  Elements  of  Pure  Economics,  ed.  and  tr.  by  W.  Jaffe, 
(London:  George  Allen  and  Unwin,  1954),  pp.  84—86,  90—91,  105—6,  169—172, 
243 — 54,  and  184 — 95.  For  a  vigorous  and  stimulating  analysis  of  Walras's 
theory  of  tatonnements  and  its  central  function  in  seeking  to  show  that  economic 
equilibrium  is  solved  in  the  market  by  successive  approximations,  see  D.  Patinkin, 
Money,  Interest,  and  Prices  (Evanston,  Illinois,  and  White  Plains,  New  York: 
Row,  Peterson  and  Company,   1956),  pp.  377 — 85. 
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equilibrium  is  also  a  computational  method  for  solving  the 
problem  of  optimal  resource  allocation.  Indeed,  it  was  seen  in 
precisely  this  light  by  Pareto2),  who  compared  the  market 
to  a  computing  machine.  Of  course,  as  soon  as  the  problem  of 
optimal  resource  allocation  is  formulated  as  the  solution  of  a 
system  of  simultaneous  equations  (namely,  the  equations 
defining  competitive  equihbrium),  the  possibility  arises  of  solving 
them  by  some  centralized  procedure  involving  the  use  of  com- 
puting machines  rather  than  the  market.  Pareto  objected  that 
the  enormous  number  of  equations  made  such  a  procedure 
impossible.  A  completely  centralized  organization  would  require 
a  capacity  for  storage  and  processing  of  technological  and  other 
information  that  exceeds  anything  likely  to  be  available.  The 
competitive  process,  on  the  contrary,  achieves  decentralization*) 
At  each  stage  in  the  market's  process  of  successive  approxima- 
tions, any  individual  firm  adjusts  its  tentative  production  plans 
making  use  of  information  only  about  the  current  tentative 
prices  and  its  own  technology.  The  adjustments  of  tentative 
prices,  at  the  same  time,  depend  only  on  the  aggregate  demands 
and  supplies.  These  are  simply  a  sum  of  the  tentative  production 
plans  of  the  individual  firms  (and  consumption  plans  of  con- 
sumers) plus  the  originally  existing  supplies  of  basic  resources. 

Thus  the  information  needed  by  firms  and  consumers  consists 
solely  of  their  technologies  or  utility  functions  plus  prices,  while 
the  adjustment  of  prices  is  based  only  on  the  aggregate  of  in- 
dividuals' decisions.  It  is  the  minimization  of  information 
requirements  for  each  participant  in  the  economy  which  con- 
stitutes the  virtue  of  decentralization. 

Our  aim  here  is  to  state  more  precisely  than  hitherto  the 

2)  V.  Pareto,  Manuel  a"  £conomie  Politique  (Deuxieme  Edition,  Paris:  Marcel 
Giard  1927),  pp.  233 — 34.  In  the  terminology  of  cybernetics,  Pareto,  crystal- 
lizing the  usual  views  of  economists,  has  held  that  the  market  mechanism  is 
homeostatic,  a  position  strongly  denied  by  the  founder  of  cybernetics;  see  N. 
Wiener,  Cybernetics  (New  York  and  Paris:  John  Wiley  and  Sons,  and  Hermann 
«t  Cie.,  1948),  pp.  185  —  86.  The  comparison  between  the  market  and  a  com- 
puting machine  has  also  been  made  by  R.  M.  Goodwin,  "Iteration,  Automatic 
Computers,  and  Economic  Dynamics,"  Metroeconomica,  III  (1951),  1 — 7. 

3)  For  a  possible  definition  of  the  term  "decentralization"  in  this  context, 
see  L.  Hurwicz,  "Decentralized  Resource  Allocation,"  Cowles  Commission 
Discussion  Paper  No.  2112,  1955.  See  also  the  first  paragraph  of  p.  77  below. 
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dynamic  system  which  is  implied  by  the  market  mechanism, 
to  give  conditions  under  which  the  resource  allocation  determined 
by  it  converges  to  the  optimal,  where  optimality  is  defined  in 
terms  of  a  single  utility  function  for  the  economy,  and  to  suggest 
modifications  of  the  market  mechanism  which  still  preserve 
some  degree  of  decentralization  for  cases  where  the  conditions 
in  question  are  not  satisfied.  The  conditions  for  convergence 
of  the  unmodified  market  mechanism  are  basically  those  of 
diminishing  or  constant  returns  in  production  and  diminishing 
marginal  utility  (in  a  generalized  sense)  for  the  consumption  of 
final  demands.  To  study  these  dynamic  problems,  we  will  make 
use  of  a  variety  of  mathematical  tools,  some  of  which  have  arisen 
in  the  theory  of  games  and  of  linear  and  nonlinear  programming 
and  some  of  which  are  more  classical  applications  of  differential 
equations  to  maximization  problems. 

In  section  B  of  this  part,  we  will  review  the  history  of  the 
problem  of  dynamic  adjustment  to  a  social  optimum  from  its 
initial  formulation  by  Walras.  In  section  C,  we  will  state  more 
precisely  our  resource  allocation  model.  In  part  II,  the  mathe- 
matical tools  that  will  be  used  are  reviewed;  they  are  primarily 
the  gradient  method  (or  method  of  steepest  ascent),  which  is 
a  system  of  differential  equations  for  solving  a  maximization 
problem,  and  the  relation  between  constrained  maxima  and  saddle 
points.  In  part  III,  we  study  the  case  of  production  under 
diminishing  or  constant  returns  with  a  strictly  concave  utility 
function  (see  Definition  1  below;  the  condition  is  a  generalized 
form  of  diminishing  marginal  utility).  In  this  case  two  dynamic 
systems  which  formalize  the  intuitive  notion  of  a  market 
mechanism  have  been  shown  to  converge.  In  part  IV,  we 
consider  the  cases  where  unmodified  market  processes  do  not 
converge,  in  particular,  that  of  increasing  returns.  Here  three 
modified  market  mechanisms  are  proposed  and  shown  to  have 
some  desirable  properties.  One  such  mechanism  is  closely  related 
to  imperfect  competition,  another  to  speculation.  It  is  also  shown 
that  the  purely  linear  case  (linear  utility  function  and  constant 
returns  to  scale),  for  which  the  unmodified  market  mechanisms 
do  not  converge,  can  be  completely  solved  by  some  of  the 
modified  mechanisms. 
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B.  Historical  Remarks. 

Most  of  the  discussions  in  the  economic  literature  on  the 
achievement  of  a  social  optimum  through  the  market,  whether 
in  a  socialist  or  a  capitalist  economy,  have  contented  themselves 
with  a  static  characterization.4)  The  main  contributors  to  a 
dynamic  formulation  of  the  market  mechanism  have  been  Walras, 
Pareto,  Taylor5,)  Lange,6)  and  Samuelson.7) 

In  the  present  study,  we  are  concerned  with  a  single  consumer 
in  whose  interests  the  whole  economy  is  run.  Walras,  Pareto, 
Taylor,  and  Samuelson  are  all  concerned  with  a  multiplicity  of 
consumers;  Lange  deals  with  both  cases.  In  the  many-consumer 
case,  it  is  assumed  that  the  consumer  maximizes  his  utility 
instantaneously,  so  that  his  demand  for  commodities  is  a  given 
function  of  prices. 

Walras  was  not  explicitly  concerned  with  a  socialist  economy, 
but  he  did  regard  the  competitive  system  as  a  computing  device 
for  achieving  a  maximum  of  satisfaction  to  society.  On  the 
production  side,  he  assumed  that  all  production  processes  were 
linear-homogeneous.8)  In  this  case,  the  profit  is  proportional 
to  the  scale,  at  any  given  set  of  prices,  and  the  marginal  profita- 
bility is  simply  a  constant.  Walras*  rule  for  adjustment  when  the 

4)  See  for  example  A.  P.  Lerner,  The  Economics  of  Control  (New  York: 
Macmillan,  1946)  or  J.E.  Meade,  Planning  and  the  Price  Mechanism  (London: 
George  Allen  and  Unwin,   1948). 

5)  F.  M.  Taylor,  "The  Guidance  of  Production  in  a  Socialist  State,"  American 
Economic  Review,  Vol.  19  (1929),  No.  1,  reprinted  in  B.  Lippincott  (ed.),  On 
the  Economic  Theory  of  Socialism  (Minneapolis:  University  of  Minnesota  Press, 
1938),  pp.  41 — 54,  particularly  pp.  50  —  54. 

6)  O.  Lange,  "On  the  Economic  Theory  of  Socialism,"  Review  of  Economic 
Studies,  IV,  Nos.  1  and  2  (1936-7)  reprinted  in  B.  Lippincott,  ibid.,  pp.  57 — 142, 
particularly  pp.  70  —  98. 

7)  P.  A.  Samuelson,  Foundations  of  Economic  Analysis  (Cambridge,  Mass.: 
Harvard  University  Press,   1947)  pp.  269—75. 

8)  We  follow  the  discussion  in  Walras,  op.  cit.,  Lesson  22,  which  treats  the 
adjustment  process  when  there  are  both  consumers  and  producers.  In  this 
lesson,  it  is  also  assumed  that  there  is  only  one  output  for  each  process  and 
only  one  process  for  producing  each  commodity  (i.e.,  the  assumption  of  fixed 
production  coefficients),  but  this  assumption  is  irrelevant  to  our  purposes. 
Later,  Walras  considers  the  production  coefficients  not  as  given  but  as  derived 
by  minimization  of  costs  with  a  given  production  function  (pp.  383 — 86); 
but  he  never  successfully  integrates  this  discussion  with  the  earlier  discussion 
of  adjustment,  as  Jaffe  points  out  (pp.  552 — 53). 
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economic  system  is  in  an  initial  state  of  disequilibrium  is  that 

(a)  the  price  change  has  the  same  sign  as  the  excess  demand,  and 

(b)  the  change  in  the  scale  of  each  process  has  the  same  sign  as 
the  marginal  profitability.9) 

Warms'  adjustment  rule  for  firms  is  an  inevitable  consequence 
of  his  assumption  of  constant  returns  to  scale.  We  cannot 
postulate  that  a  firm  will  instantaneously  maximize  its  profits 
at  any  given  level  of  prices;  under  constant  returns,  the  profit- 
maximizing  scale  may  be  infinite,  if  positive  profits  are  possible 
at  some  level,  or  it  may  be  that  there  are  zero  profits  at  all 
scales,  in  which  case  profit-maximization  does  not  define  the 
behavior  of  the  firm,  or,  if  profits  are  negative  at  all  positive 
scales,  the  optimal  scale  is  zero.  Walras  correctly  saw  that 
under  these  circumstances  he  could  not  prescribe  instantaneous- 
profit-maximization  in  the  way  he  did  require  instantaneous 
utihty-maximization  by  the  consumer.  Despite  the  necessary 
character  of  lagged  adjustment,  Walras'  successors  were  usually 
not  so  careful,  and  a  clearer  understanding  of  the  special  adjust- 
ment problems  connected  with  constant  returns  to  scales  was 
not  achieved  until  the  development  of  game  theory  and  linear 
programming  brought  them  to  the  fore. 

Pareto  explicitly  noted  that  a  socialist  system  would  have  to 
mimic  the  process  of  competition  to  achieve  an  optimal  allo- 

9)  Walras,  op.  cit.,  pp.  253 — 54;  the  terminology  has  been  changed  to  conform 
to  that  used  below.  The  reader  should  be  warned  that  Walras'  presentation 
is  by  no  means  unequivocal,  and  the  interpretation  is  not  the  only  one  possible. 
We  have  followed  the  statement  which  summarizes  the  lesson  (22)  on  taton- 
nements  in  an  economy  with  consumption  and  production.  But  the  preceding 
ten  pages,  if  taken  literally,  present  an  adjustment  process  only  distantly 
related  to  the  summary.  Some  prices  are  held  constant  while  others  vary 
independently  to  clear  different  markets;  similarly  quantities  are  adjusted  to 
make  profits  zero  in  each  industry,  but  the  adjustment  process  described  takes 
all  prices  other  than  the  selling  price  as  given.  It  is  hard  to  believe  that  Walras 
meant  to  describe  reality  as  if  markets  and  firms  came  into  equilibrium  in  a 
preassigned  order.  It  is  perhaps  this  difficulty  which  led  Goodwin  {op.  cit., 
p.  5)  to  deny  that  Walras  meant  his  adjustment  process  to  be  practical,  as 
describing  either  reality  or  a  device  for  the  operation  of  a  socialist  society. 
However  we  feel  that  Walras  suffered  in  his  exposition  from  the  crudity  of  his 
mathematical  tools;  he  was  seeking  to  explain  a  simultaneous  adjustment  in 
many  markets  and  within  many  firms  without  using  the  concept  of  a  system 
of  differential  (or  difference)  equations  and  hence  was  forced  to  resort  to  a  crude 
formulation  in  which  some  variables  changed  while  others  are  held  constant. 
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cation10)  and,  following  Walras,  laid  great  emphasis  on  the 
market  as  a  computing  device  to  solve  the  system  of  equations 
of  general  economic  equilibrium.  However,  his  description  of  the 
dynamics  of  adjustment  is  considerably  less  precise  than  Walras'. 
In  the  Cours  (published  in  1896)  he  indicated  rather  sketchily 
his  agreement  with  Walras'  description  of  the  competitive 
process11)  and,  while  agreeing  with  Edge  worth  that  computa- 
tionally there  are  many  ways  of  achieving  an  economic  optimum, 
contends  that  Walras'  way  is  the  natural  economic  way.12) 

The  description  in  the  later  Manuel  is  more  extensive  but  more 
obscure.  The  discussion  of  stability  implies  a  dynamic  system  in 
which  price  responds  to  a  difference  of  supply  and  demand.13) 
However,  the  meaning  of  the  supply  functions  for  firms  is  never 
clearly  defined.  For  those  operating  under  diminishing  returns, 
it  is  the  ordinary  profit-maximization  rule;  for  others,  it  appears 
rather  to  be  the  rule  that  price  equals  average  cost.14)  The 
justification  for  the  latter  however  is  based  on  free  entry,  which 
is  itself  a  dynamic  process  akin  to  Walras',  and  should  have  been 
introduced  explicitly. 

Taylor  distinguished  between  primary  factors  and  produced 
commodities.  For  any  given  set  of  prices  of  the  former,  the  price 
of  the  latter  are  set  so  that  price  equals  cost  of  production.15) 
The  demand  by  consumers  at  these  prices  indirectly  generates 
a  demand  for  primary  factors.  The  prices  of  these  are  then 
increased  if  demand  exceeds  supply,  decreased  in  the  opposite 
case.16) 

Taylor's  rule  for  price-setting  by  producers  is  anything  but 
clear.  In  the  special  case  where  constant  returns  to  scale  prevail, 
there  is  no  joint  production,  and  each  firm  produces  a  final 
product  directly  from  primary  factors,  Taylor's  rule  is  unequi- 

10)  Manuel,  op.  cit.,  pp.  362 — 64.  There  is  an  amusing  misprint,  where  he 
speaks  of  a  collectivist  society  "qui  ait  pour  but  de  procurer  a  ses  membres 
le  minimum  d'ophelimite"   (p.  362,  italics  added). 

11 )  Cours  d'economie  politique  (Tome  Premier,  Lausanne,  Paris,  and  Leipzig: 
F.  Rouge,  Pichon,  and  Duncker  and  Humblot,   1896),  pp.  45  —  47. 

12)  Ibid.,  p.  25. 

13)  Manuel,  pp.  223—24,  232—33. 

14)  Ibid.,  pp.   177—79,   185—87. 
16)  Taylor,  op.  cit.,  p.  45. 

16)  Taylor,  op.  cit.,  p.  53. 
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vocal.  However,  in  the  absence  of  constant  returns,  average  costs 
will  depend  upon  a  system  of  simultaneous  equations  involving 
the  demand  functions  among  others.  Even  with  constant  returns, 
the  setting  of  prices  by  Taylor's  rule  require  solution  of  a  system  of 
simultaneous  equations  if  products  of  some  firms  are  used  by 
others.  If  there  are  alternative  processes  for  producing  the 
same  commodity,  there  would  also  be  a  minimization  problem 
to  solve.17)  No  indication  is  given  as  to  the  solution  of  these 
problems;  certainly  there  would  have  to  be  some  degree  of 
centralization,  in  the  sense  that  a  central  productive  agency 
would  have  to  have  access  to  all  the  technical  coefficients,  at 
least,  in  order  to  determine  prices. 

Lange  has  been  the  strongest  defender  of  the  proposition  that 
a  socialist  economy  can  achieve  both  an  optimal  allocation  of 
resources  and  the  computational  and  informational  virtues  of 
decentralization.  He  presents  two  adjustment  models,  ac- 
cording as  there  is  or  is  not  consumers'  sovereignty.  In  both 
cases,  the  behavior  of  firms  is  defined  by  the  principle  of  marginal- 
cost  pricing.  That  is,  at  given  prices,  the  firm  is  supposed  first 
to  find  for  any  given  output  the  minimum  cost  of  producing  it. 
The  output  is  then  determined  so  as  to  equate  marginal  cost,  so 
calculated,  to  price.  The  demand  functions  for  factors  are 
determined  by  the  cost-minimization  criterion. 

This  formulation  is  clearly  designed  to  encompass  the  cases  of 
both  increasing  and  decreasing  returns.  In  the  latter  case,  it 
corresponds  in  general  to  the  single  rule  of  choosing  inputs  and 
outputs  so  as  to  maximize  profits.  As  we  shall  see  below,18  the 
rule  is  inadequate  and  strictly  speaking  incorrect  for  the  case  of 
increasing  returns.  Further,  the  rule  clearly  does  not  meet  the 
problems  raised  by  the  case  of  constant  returns,  as  sketched 
above. 


17 )  If  there  is  no  joint  production,  then  the  efficient  choice  of  processes  will 
be  the  same  for  all  demand  functions.  See  T.  C.  Koopmans,  ed.,  Activity  Analysis 
of  Production  and  Allocation,  Cowles  Commission  Monograph  No.  13  (New 
York  and  London:  John  Wiley  &  Sons  and  Chapman  &  Hall,  1951),  Chs. 
VII — X,  pp.  142 — 73.  If  there  is  joint  production,  then  the  determination 
of  the  optimal  set  of  process  and  therefore  of  prices  according  to  Taylor's  rule 
cannot  be  made  independently  of  the  demand  function. 

18)  See  part  IV,  section  C. 
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The  firm's  behavior  defines  the  supply  and  demand  functions 
of  firms.  In  Lange's  first  model,  the  demand  functions  of  con- 
sumers are  defined  by  utility  maximization.  The  supply  and 
demand  functions  for  the  whole  market  now  being  defined,  the 
state  is  to  vary  prices  in  accordance  with  supply  and  demand. 
Lange  recognized  that  it  had  not  been  proved  that  this  dynamic 
process  would  necessarily  converge  to  the  equilibrium  which 
corresponded  to  optimal  resource  allocation;  the  process  might 
not  converge  to  anything  at  all  but  instead  oscillate  indefinitely 
or  even  diverge  explosively.19)  He  suggests  that  the  price- 
adjustment  rules  of  the  state  might  have  to  be  modified  in  some 
way  to  avoid  oscillations,  as  by  taking  account  of  the  anticipated 
effects  of  price  changes  on  quantities,  a  point  which  is  illustrated 
below  for  the  case  of  increasing  returns.20) 

Lange's  discussion  of  the  second  model,  where  there  is  only 
one  utility  function,  that  of  a  Central  Planning  Board,  does  not 
clearly  define  the  dynamic  process.21)  The  meaning  seems  to 
be  that  the  Central  Planning  Board  determines  its  demand 
functions  from  its  utility  function  as  if  it  were  a  consumer  and 
then  prices  respond  to  supply  and  demand,  as  before.  It  is  not 
clear  whether  the  Board  is  subjected  to  a  budget  limitation  or 
it  simply  equates  marginal  utility  to  price. 

Samuelson's  well-known  restatement  of  the  Walrasian  dynamic 
system  assumes  that  supply  and  demand  functions  are  well- 
defined.  His  simultaneous  dynamic  system  is  then  simply  the 
statement  that  the  rate  of  change  of  each  price  is  proportional 
to  the  difference  between  demand  and  supply.  Though  the 
essentials  of  the  system  are  found  in  Walras  and  Lange,  it  is 
Samuelson  who  has  first  made  it  explicit,  and  doubtless  our 
reading  of  the  first  two  is  done  through  the  spectacles  supplied 
by  Samuelson. 

The  development  of  linear  programming  put  renewed  em- 
phasis on  the  problems  raised  by  constant  returns  in  production. 
In  a  purely  linear  economy,  where  each  process  operates  under 
constant  returns  and  the  utility  function  of  the  economic  system 


19)  Lange,  op.  tit.,  fn  43,  pp.  89—90. 

20)  See  part  IV,  parts  D,  E,  and  F. 

21 )  Lange,  op.  cit.,  pp.  90—93. 
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is  linear,  the  system  of  computations  analogous  to  the  market 
led  to  indefinite  oscillations,  as  was  shown  by  Samuelson.22) 
Indeed,  this  problem  was  the  starting  point  of  the  present 
investigation  for  the  authors. 

C.  Formal  Statement  of  the  Resource  Allocation  Model. 

Let  there  be  s  commodities  and  m  processes  for  carrying  on 
production  of  commodities.  Each  process  may  be  carried  on 
at  different  scales,  and  it  is  not  supposed  in  general  that  there 
are  constant  returns  to  scale.  Let  xi  be  the  scale  of  the  jth  process; 
for  most  purposes,  Xj  may  most  conveniently  be  thought  of  as 
the  amount  of  output  of  the  jth  process,  or  an  index  of  outputs 
if  the  process  has  more  than  one.  Let  gw(jty)  be  the  amount  of 
commodity  i  produced  by  the  jth  process  when  the  latter  is 
conducted  at  scale  x^  a  negative  value  for  gti(xt)  refers  to  an 
input. 

Among  the  s  commodities  we  will  distinguish  a  sub-class  of 
n  desired  commodities  which  enter  into  final  uses.  The  remaining 
or  primary  commodities  are  useful  only  because  they  enter  into 
the  production  of  the  desired  commodities  directly  or  indirectly. 
Let  yt  be  the  amount  of  desired  commodity  i  used  by  the  economy 
for  final  demands  —  that  is,  not  used  up  in  one  of  the  productive 
processes.  The  economy  is  assumed  to  possess  a  single  utility 
function  which  has  as  variables  the  final  demands  for  the 
desired  commodities  —  that  is,  a  function, 

(1)  U(yi yn). 

The  total  output  of  commodity  i  by  the  productive  processes  is 
Sml  gij(xj)-  In  addition,  some  commodities  (particularly  natural 
resources  and  labor)  are  available  in  positive  quantities  without 
production.  Let  Si  be  the  amount  of  commodity  i  available 
initially;  Si  will,  of  course,  be  zero  for  most  commodities.  A 
primary  commodity  for  which  £,-  =  0  is  usually  referred  to  as 
an  intermediate  commodity,  but  we  shall  not  need  to  make  a 

22)  In  an  unpublished  memorandum  of  1950;  see  R.  Dorfman,  P.  A.  Samuelson, 
and  R.  Solow,  Linear  Programming  and  Economic  Analysis  (New  York,  Toronto, 
and  London:  McGraw-Hill,  1958),  fn.  1,  p.  63.  An  example  is  given  in  part  II, 
section  D. 
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distinction  between  primary  and  intermediate  commodities. 
In  order  for  a  given  set  of  final  demands  ylf  .  .  .,  yn  to  be  feasible, 
it  is  necessary  that  they  do  not  exceed  the  total  available  from  the 
productive  sector  plus  the  initial  availabilities;  it  is  also  necessary 
that  the  total  output  of  the  primary  commodities  plus  that 
initially  available  be  at  least  zero.  Hence, 

m 

(2.1)  y«^2*«(*,)+ft  [i=l,...,n), 

3=1 

m 

(2.2)  o^2s«fo)'+*«     (*  =  »  +  i.;.;,s). 

3=1 

Here  we  designate  the  desired  commodities  as  1,  .  .  .,  n  and  the 
primary  commodities  as  n  +  1,  .  .  .,  s.  The  resource  allocation 
problem  is  then  to  choose  yv  .  .  .,yn  and  xlt  .  .  .,  xm  so  as  to 
maximize  U(yv  .  .  .,  yn)  among  all  sets  of  variables  which  satisfy 
the  feasibility  conditions  (2). 

It  is  to  be  noted  that  we  have  assumed  the  absence  of  external 
economies  and  diseconomies  (as  between  processes),  since  the 
inputs  and  outputs  gu  do  not  depend  upon  the  scale  of  any 
other  process  than  the  jth. 

We  have  presented  the  production  sector  of  the  resource 
allocation  model  in  the  form  of  processes  rather  than  the  more 
usual  production  functions.  In  this,  we  follow  Koopmans23) 
who  argues  persuasively  that  the  latter  is  a  derived  concept 
which  already  implies  some  elements  of  optimization.  The 
formulation  in  terms  of  processes  is  also  preferable  if  the  model 
is  interpreted  as  referring  to  optimization  within  a  large  firm. 
However,  we  generalize  Koopmans'  model  by  admitting  non- 
linear functions  gtj,  so  that  the  effects  of  non-constant  returns 
to  scale  can  be  studied. 

For  future  reference,  note  that  the  variables  yv  .  .  .,  yn, 
xv  .  .  .,  xm  are  necessarily  non-negative  from  their  very  defini- 
tion. 


28)  T.  C.  Koopmans,  "Analysis  of  Production  as  an  Efficient  Combination 
of  Activities,"  ch.  Ill  in  Activity  Analysis,  pp.  33  —  97,  especially  pp.  33 — 34. 
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II.  MATHEMATICAL  BACKGROUND 

A.  Some  Notes  on  Unconstrained  Maxima. 

I.  Necessary  Conditions. 

The  numbers  zlt  .  .  .,  zp  form  the  (unconstrained)  maximum  of 
the  (real  valued)  function/^,  . . .,  zp)  iif(zv  . . .,  zv)  ^f(zv  . . .,  zp) 
for  all  possible  combinations  of  values  of  zlf  .  .  .,  zp.  It  is  per- 
missible to  think  of  the  z/s  as  being  restricted  to  non-negative 
values,  if  appropriate. 

For  an  economic  example  of  unconstrained  maximization  in 
the  present  model,  consider  the  manager  of  a  single  process, 
say  the  /th,  who  buys  and  sells  the  commodities  at  prices 
Pit  •  •  '*  fin  which  are  given  to  him.  His  profit,  then,  is 

n 

(3)  ns(xt)  =2pigii(xi)- 

i=i 

In  a  competitive  world,  the  manager  seeks  to  choose  xj  to 
maximize  (3).  Of  course,  this  is  an  example  where  there  is  only 
one  variable. 

We  are  not  primarily  interested  in  unconstrained  maxima  as 
such,  but  a  brief  discussion  of  some  points  connected  with  them 
will  serve  to  illustrate  developments  in  the  more  complicated 
case  of  constrained  maxima  with  which  we  are  more  concerned. 

It  should  first  be  noted  that  an  arbitrary  function  need  not 
have  a  maximum.  For  example,  in  (3),  if  each  of  the  functions 
gij(Xj)  is  linear,  then  so  is  %(#,)  (recall  that  the  prices  j>i  are 
taken  as  given  numbers,  not  variables  as  far  as  the  process 
manager  is  concerned).  If  the  linear  function  TZjfa)  has  a  positive 
slope,  then  n$  can  be  made  as  large  as  desired  by  making  in- 
sufficiently large,  so  that  no  finite  value  can  be  designated  as 
the  maximum.  Even  a  non-linear  function  need  not  have  a 
maximum. 

Second,  it  should  be  noted  that  a  function  may  have  more 
than  one  maximum.  An  extreme  case  is  a  constant,  for  which 
all  points  are  maxima;  thus  if  the  profit  function  %(&;)  is  every- 
where zero,  a  situation  which  arises  in  linear  programming,  all 
values  of  xs  maximize  it. 

Third,  we  have  defined  a  maximum  with  respect  to  all  per- 
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missible  variations  in  the  variables,  sometimes  referred  to  as  a 
global  maximum.  It  is  frequently  useful  to  use  the  weaker  concept 
of  a  local  maximum,  a  set  of  numbers  zlt  .  .  .,  z9  such  that 
f(zv  .  .  .,  zv)  ^  f(zv  .  .  .,  zv)  for  possible  values  of  the  z/s  in  a 
small  neighborhood  of  zv  .  .  .,  zp.  A  global  maximum  is  clearly, 
a  local  maximum  but  not,  in  general,  conversely.  In  this  work; 
the  term,  "maximum,"  when  unqualified,  will  refer  to  a  global 
maximum. 

If  the  function  /  is  differentiate  and  has  a  local  maximum 
and  the  variables  zt  are  unrestricted  as  to  sign,  then,  as  is  well 
known,  the  partial  derivatives  d//ctet-  =  0  for  all  i  =  1,  .  .  .,  p 
when  evaluated  at  the  maximum.  We  shall  use  the  notation 
fz  for  dfjdzt  in  general  and  Jz  for  /  evaluated  at  zv  .  .  .,  zp.. 
Then  a  necessary  condition  for  a  local  (and  hence  for  a  global) 
maximum  when  the  z/s  are  not  restricted  as  to  sign  is  that 
fSi=0  (i=l,...,p). 

This  last  condition  is,  of  course,  not  sufficient  even  for  a 
local  maximum,  let  alone  a  global  one.  The  condition  also  holds 
at  minima  and  indeed  at  some  points  which  are  neither  maxima 
nor  minima. 

2.  Concave  Functions  and  Sufficient  Conditions. 

The  condition  that  the  derivatives  be  zero  does  become  a, 
sufficient  condition  for  a  global  maximum  if  the  function 
f(zv  .  .  .,  zv)  is  restricted  to  a  special  class,  known  as  concave 
functions.  If  there  is  only  one  variable,  a  concave  function  is  one 
in  which  the  slope  is  never  increasing  as  the  variable  increases. 
A  function  of  one  variable  is  said  to  be  strictly  concave  if  the 
slope  is  decreasing;  such  a  function  is  illustrated  in  Figure  1. 
A  concave  function  which  is  not  strictly  concave  differs  only  in 
that  the  graph  might  have  linear  segments.  It  is  easy  to  see 
graphically  that  the  point  z  at  which  the  derivative  is  zero  is 
indeed  the  maximum. 

In  Figure  1  it  can  be  seen  that  if  we  take  two  points,  such  as 
z  and  z\  the  part  of  the  graph  between  the  two  points  lies  above 
the  line  segment  joining  them.  This  property  can  serve  to  yield 
definitions  of  concavity  and  strict  concavity  for  functions  of  any 
number  of  variables. 
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Definition  1.  The  (real- valued)  function  f(zv  .  .  .,  zv)  is  said 
to  be  concave  if  for  any  two  points  (zlf  .  .  .,  zp)  and  (z'v  .  .  .,  z'p) 
and  any  third  point  {?[',  .  .  .,zp)  such  that  z"  =  dzt  -f-  (1  —  0)^ 
(*'  =  1,  .  .  .,  j>),  where  6  is  a  real  number  between  0  and  1, 
/(*!-,  •  •  .,  <)  ^  ^  /fo  •  •  ..  z9)  +  (1  -  0)/(4  .  .  .,  <).  The  func- 
tion  f(zv  .  .  .,  zv)  is  said  to  be  strictly  concave  if  the  strict  inequality 
holds  in  the  last  statement. 

Note  that  a  linear  function  is  concave  but  not  strictly  concave. 

In  the  case  of  a  single  process  in  a  competitive  world,  where 
the  function  (3)  is  to  be  maximized,  then  the  condition  dni\dxj  =  0 
is  both  necessary  and  sufficient  for  maximization  if  the  function 
nj(Xj)  is  concave  and  differentiable.  That  is,  the  point  of  maxi- 
mum profit  is  that  for  which  the  marginal  profitability  is  zero.  (If 
the  function  n^Xj)  is  not  concave,  then  this  need  not  be  true; 
in  that  case,  the  point  where  the  marginal  profitability  is  zero 
might  conceivably  be  a  point  of  minimum  profit.)  Suppose 
now  that  each  of  the  functions  g^fa^  is  concave;  then  it  is  easy 
to  see  that,  since  the  prices  ftt  are  presumably  non-negative, 
the  function  nj(xj)  is  also  concave.  Thus  the  concavity  of  the 
functions  &,■(%)  insures  that  the  condition  of  zero  marginal 
profitability  (price  equals  marginal  cost)  is  equivalent  to  profit 
maximization. 
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What  is  the  meaning  of  the  condition  that  the  functions 
gijixj)  are  concave?  For  simplicity,  assume  that  the  process  has 
only  one  output,  say  commodity  1,  and  that  the  scale  of  the 
process  is  measured  by  that  output  so  that  g^fo)  =  xi  (note 
that  this  function,  being  linear,  is  certainly  concave).  Since  the 
other  commodities  are  inputs,  g^(#3)  is  the  negative  of  the  input 
of  commodity  i  for  i  >  1.  Since  also  gw(0)  =  0  (for  zero  output, 
zero  inputs  suffice),  it  is  easy  to  see  that  concavity  of  the 
function  gw(sfy)  means  that  the  input  of  the  ith  commodity 
increases  at  least  as  rapidly  as  the  output,  which  is  the  same 
as  non-increasing  returns.  Strict  concavity  would  imply  diminish- 
ing returns,  while  the  requirement  of  concavity  alone  permits 
constant  returns. 

3.  The  Gradient  Method. 

Now  consider  the  problem  of  finding  an  unconstrained  maxi- 
mum by  a  process  of  successive  approximations,  starting  from 
some  given  first  approximation,  say  z\,  .  .  .,  zj.  One  possible 
procedure  is  to  vary  each  coordinate  separately  in  such  a  way 
as  to  increase  the  function  f(zv  .  .  .,  zp).  Thus,  suppose  dfjdzt  >  0 
at  the  initial  point.  Then  it  is  reasonable  to  increase  zt  somewhat; 
the  contrary  would  be  true  if  dfjdzt  <  0.  The  same  reasoning 
applies  to  each  coordinate.  Thus  a  reasonable  process  would  call 
for  picking  a  new  point  z{,  .  .  .,  zp  in  such  a  way  that, 

(4)  z\—z\  has  the  same  sign  as  fz(z\t  . . .,  z°p)  for  each  *"==  1,  . . .,  p. 

The  same  procedure  can  be  applied  again  starting  with  z\,  .  .  .,  z\\ 
indeed,  such  repetition  is  the  essence  of  an  iterative  procedure. 
Thus,  at  the  tih  step, 

(5)  zi£rX—z\  has  the  same  sign  as  fz{z\,  . . .,  z*p)  for  each  i=  1, . . .,  p. 

The  method  specified  by  (5)  is  a  method  of  finite  differences; 
mathematically,  it  is  simpler  to  assume  that  the  process  of 
adjustment  takes  place  continuously  rather  than  in  the  small 
steps  implied  in  (5).  To  effect  this,  the  finite  difference  zf'1— z\ 
should  be  replaced  by  the  corresponding  derivative  dzjdt, 
that  is,  the  rate  of  change  of  the  ith  coordinate  with  respect  to 
(computational)  time.  The  requirement  in  (5)  is  then  replaced 
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by  the  condition 

(6)  dzjdt  has  the  same  sign  as  fZf{zv  .  .  .,  zv). 
Finally,  the  simplest  way  to  insure  that  (6)  holds  is  to  require24) 

(7)  dzjdt  =  k<  (dfldzt)     (i  =  1,  .  .  .,  p), 

where  ki  is  a  positive  constant.  Relation  (7)  defines  a  system  of 
differential  equations;  the  solution  of  this  sytem  defines  each 
coordinate  as  a  function  of  time,  and  this  is  a  description  of  the 
adjustment  process.  We  shall  refer  to  (7)  as  the  gradient  method.2^) 
The  constant  kt  is  the  adjustment  speed. 

It  is  easy  to  see  that  we  can  assume  kt  =  1  without  loss  of 
generality  simply  by  changing  the  units  in  which  zi  is  meas- 
ured. For  let  z.  =  a^;  then  dzjdt  —  atdzjdt,  while  dfjdz.  = 
{\jai){dfldzi))  so  that  (7)  becomes 

dz'ildt  =  a*ki(dfldzi). 

We  can  choose  at  so  that  c?iki  =  1.  In  this  form,  with  which  we 
shall  be  mostly  concerned,  the  gradient  method  becomes 

(8)  dzjdt  =  df/dzt. 

For  this  method  to  be  acceptable,  it  is  necessary  that  the  solution 
of  (8)  converge  to  the  maximum  values  zv  .  .  .,  z v.  If  we  look 
again  at  Figure  1,  it  is  intuitively  obvious  that  the  process  (8) 
will  indeed  converge  to  z ;  for  if  the  starting  point  z°  is  below  z, 
the  rule  (8)  calls  for  increasing  z  up  to  the  point  z  where  the 
derivative  is  zero  but  not  going  beyond.  In  general,  we  can  make 
the  following  statement: 

(9)  If  f(zv  . . .,  Zp)  is  a  strictly  concave  function,  then  the  gradient 
process  (8)  converges  to  the  maximum  point  (zv  .  . .,  zv). 

24)  fz^v  •  •  ■>  zp)  and  dfjdZi  have  the  same  meaning. 

25)  See  H.  B.  Curry,  "The  Method  of  Steepest  Descent  for  Non-Linear 
Maximization  Problems,"  Quarterly  of  Applied  Mathematics,  Vol.  2  (1944), 
258  —  61,  who  discusses  finite  difference  methods;  C.  B.  Tompkins,  "Methods 
of  Steep  Descent,"  ch.  18  in  E.  F.  Beckenbach,  ed.,  Modern  Mathematics  for 
the  Engineer  (New  York:  McGraw-Hill,  1956),  pp.  448  —  79.  A  brief  discussion 
is  also  found  in  Samuelson,  Foundations,  pp.  301  —  2.  Strictly  speaking,  (7)  is 
only  a  special  case  of  the  gradient  method  (sometimes  called  the  method  of 
steepest  ascent),  but  since  it  is  the  only  form  with  which  we  shall  be  concerned, 
no  confusion  will  result. 
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(The  requirement  of  strict  concavity  instead  of  concavity  is  not 
basic,  but  the  statement  of  (9)  would  have  to  be  more  com- 
plicated otherwise.) 


*-  z 


Fig.  2 

If  f(zlf  .  .  .,  zp)  is  not  concave,  we  might  have  the  situation 
of  Figure  2;  there,  if  the  starting  point  z°  is  below  z*  or  between 
z*  and  z,  the  process  will  converge  to  the  local  maximum  z* 
rather  than  to  the  global  maximum.26 

4.  A  Limiting  Form  of  the  Gradient  Method 

We  can  imagine  in  equation  (7)  that  some  of  the  adjustment 
speeds  k{  approach  infinity.  This  is  equivalent  to  saying  that 
some  of  the  variables,  say  zlt  .  .  .,  zr,  are  adjusted  instantaneously 
to  values  which  maximize  /  for  given  values  of  the  remaining 
variables  zr+1,  .  .  .,  zp.  This  may  be  meaningful  in  situations 
where  the  function  f(zv  .  .  .,  zp)  is  of  such  a  simple  form  with 
respect  to  zv  .  .  .,  zr  that  it  is  computationally  practical  to  find 
the  maximum  with  respect  to  those  variables  by  some  fairly 
direct  method.  The  other  variables  z, 


r+l> 


,  zP  are  varied  in 
accordance  with  equations  (7),  which,  by  a  suitable  choice  of 
unit,  can  be  written  in  the  form  (8),  as  has  already  been  ex- 
plained. The  adjustment  process  then  has  the  form, 

(10)  zv  ...,zr  maximize  f(zv  . . .,  zr>  zr+1, . . .,  zp)  for  given  zr+1>. . .,  zp; 


26)  The  gradient  process  for  unconstrained  maxima  will  converge  in  general 
to  some  point  at  which  all  derivatives  are  zero. 
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(11)  dzjdi  =  df/dZi,    i  =  r+l,...,p. 

As  the  variables  zr+1,  .  .  .,  zp  change,  the  values  of  zlf  .  .  .,  zr 
which  maximize  f(z1,  .  .  .,  zp)  will  usually  change  too,27)  so  that 
all  the  variables  are  actually  changing  in  the  process.  If  we  as- 
sume that  the  function  f(zv  .  .  .,  zp)  is  concave  in  all  the  variables, 
it  is  necessarily  concave  in  zv  .  .  .,  zr,  so  that  (10)  is  equivalent  to 

(12)  dfjdz,  =  0,     *  =  1,  /.  .,  r. 

The  adjustment  process  defined  by  (11)  and  either  (10)  or  (12) 
sounds  very  reasonable,  at  least  if  the  equations  (12)  are  com- 
putationally practical  to  solve.  The  assumption  that  in  a  dynamic 
system  some  variables  adjust  slowly  while  other  variables  adjust 
virtually  immediately  to  the  first  set  is  a  not  uncommon  one  in 
economics;  we  shall  see  its  economic  interpretation  in  a  resource 
allocation  context  later. 

This  process  has  one  difficulty  which  has  some  implications 
for  the  structure  of  the  adjustment  process  in  optimal  resource 
allocation.  It  is  possible  that  for  some  values  of  zr+1,  .  .  .,  zp, 
the  function  f(zv  .  .  .,  zp)  might  not  have  a  maximum  with 
respect  to  zv  .  .  .,  zr\  that  is,  f(zv  .  .  .,  zr,  .  .  .,  zp)  might  increase 
indefinitely  as  one  or  more  of  the  z/s  (i  =  1,  .  .  .,  r)  increase 
to  infinity.  This  is  true  even  if  the  function  f(zv  .  .  .,  z9)  is 
strictly  concave.  To  be  sure,  if  zr+1,  .  .  .,  zp  have  the  values28) 
zr+1,  .  .  .,  zp,  then  the  function  f(zv  .  .  .,  zp)  has  its  maximum 
value  when  zt  =  zi  (i  =  1,  .  .  .,  r),  so  the  maximization  process 
is  well  defined  there.  In  general,  the  problem  does  not  arise  if 
zr+1,  .  .  .,  zp  are  sufficiently  close  to  the  maximizing  values 
zr+1,  .  .  .,  z p,  but  it  might  arise  otherwise. 

We  are  also  implicitly  assuming  that  the  maximum,  when  it 
exists,  is  unique. 

Subject  to  these  qualifications,  however,  the  limiting  form  of  the 
gradient  process  defined  by  (11)  and  either  (10)  or  its  equivalent 
(12)  has  the  same  satisfactory  convergence  properties  as  those 
of  the  gradient  process  (8).  Parallel  to  (9),  we  can  assert, 

27)  I.e.,  there  is  a  set  of  functional  relations  zt  —  zx{zr+1,  .  .  .  zv),  .  .  ., 
zr  —  zr(zr+1,  .  .  .,  zp)  such  that  zx,  .  .  .,  zr  determined  from  these  relations  will 
satisfy   (10). 

28)  We  recall  that  /  is  maximized  at  the  point  z  =  (zv  z2,  .  .  .,  zr,  zr+1,  .  .  .,  zp). 
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(13)  if  f(zlf  .  .  .,  zv)  is  strictly  concave,  then  in  a  region  suf- 
ficiently close  to  the  maximum  so  that  equations  (12)  are 
solvable  throughout,  the  process  defined  by  (11)  and  (12) 
converges  to  the  maximum  point  (zv  .  .  .,  zv). 

5.  Non-negative  Variables. 

The  preceding  discussion  has  assumed  that  the  variables 
zv  .  .  .,  zv  are  unrestricted  in  range.  However,  as  noted  in  part  I, 
section  C,  in  our  resource  allocation  problem,  we  are  primarily 
interested  in  the  case  where  all  variables  are  required  to  be  non- 
negative.  Consider,  for  example,  the  maximization  of  (3).  If 
the  maximizing  value  of  xj  is  positive,  then,  indeed,  the  derivative 
has  to  be  zero.  Suppose  however,  n^Xj)  has  its  maximum  at 
zero;  that  is,  operating  the  process  at  any  positive  level  involves 
a  smaller  profit  (or  greater  loss)  than  not  operating  at  all. 
This  implies  that  at  zero,  the  marginal  profitability  cannot  be 
positive,  but  it  might  be  negative,  as  illustrated  in  Figure  3. 

f(z) 


>■  z 


Fig.  3 


The  same  considerations  hold  when  there  are  several  variables. 
We  may  thus  conclude, 

(14)     Jz  ^  0  for  i  =  1,  .  .  .,  ft;  if  JZi  <  0  for  some  i,  then  zt  =  0. 

Relations  (14)  are  in  general  only  necessary  conditions  for  a 
local  maximum  in  general.  If  the  strict  inequality  never  holds, 
then,  as  we  have  seen  in  paragraph  1  of  this  section,  the  point 
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may  be  a  local  minimum.  Even  if  the  strict  inequality  does  hold 
for  some  components  but  not  for  others,  a  point  satisfying  (14) 
may  be  neither  a  maximum  nor  a  minimum.  Even  if  it  is  a 
local  maximum,  it  need  not,  of  course,  be  a  global  one. 

However,  as  in  paragraph  2  of  this  section,  relations  (14)  are 
a  sufficient  condition  for  a  global  maximum  if  the  function 
f(zv  .  .  .,  zp)  is  concave,  as  illustrated  for  one  variable  in  Figures 
1  and  3. 

The  gradient  method  described  in  (8)  also  requires  some 
modification  when  we  deal  with  non-negative  variables.  If,  at 
some  point  in  the  process,  one  of  the  variables  zi  is  zero  and  if 
at  the  same  point  dfjdzi  <  0,  the  unmodified  gradient  method 
(8)  would  require  zi  to  decrease  further,  that  is  become  negative, 
which  would  make  no  sense  if  the  variable  is  intrinsically  non- 
negative.  This  would  happen,  for  example,  in  Figure  3,  if  the 
process  has  reached  the  point  z  =  0.  To  prevent  this,  we  must 
add  to  (8)  the  rule  that  in  this  case  the  variable  remains  at  zero, 
so  that  its  value  does  not  change,  i.e.,  dzjdt  =  0. 

The  gradient  method  then  takes  the  following  form: 

(15)  ^=(^^  =  °and»<°' 

{  df/dZi  otherwise. 

It  is  easy  to  see  that  in  order  for  the  process  (15)  to  stop,  that  is, 
for  dzjdt  to  equal  zero  for  all  i,  it  is  necessary  and  sufficient 
that  the  conditions  (14)  hold.  In  other  words,  the  maximum 
zv  .  .  .,  zp  is  the  equilibrium  point  of  the  system  of  differential 
equations   (15). 

In  the  limiting  form  of  the  gradient  method,  (14)  is  applicable 
to  the  variables  zv  .  .  .,  zr,  in  view  of  (10)  applied  to  the  case  of 
non-negative  variables.  Similarly,  for  variables  zr+1,  .  .  .,  zp)  (11) 
is  replaced  by  (15).  The  method  becomes,  then, 

(16)  df/dZi  ^  0,  with  dfjdzt  <  0  only  if  zi  =  0,  for  i  =  1,  .  .  .,  r; 

(17)  dz./dt  =  ( °  lf  Zi  =  °  and  df,dZi  <  °' 

I  df/dz4  otherwise, 

for  i  =  r  -f-  1, .  .  .,  j>. 

In  both  forms  of  the  gradient  method,  convergence  is  assured 
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under  the  same  circumstances  as  in  the  case  of  variables  un- 
restricted as  to  sign. 

Finally,  for  future  reference,  note  that  minimization  calls  for 
the  same  rules  as  maximization,  with  appropriate  changes  of 
sign.  Thus,  instead  of  (14),  we  have  for  the  minimum  z, 

(18)     /2.  ^>  0  for  i  =  1,  .  .  .,  <p\  if  /z.  >  0  for  some  i,  then  zt  =  0. 

Again,  (18)  is  a  necessary  condition  for  a  minimum  in  general 
and  a  sufficient  condition  if  f(zlf  ...,zp)is  convex.29)  The  gradient 
method  (15)  applied  to  a  minimum  becomes, 

0  if  zi  =  0  and  df/dz€  >  0, 


(19)  dzjdt  =  , 
1  —dfldzt  otherwise. 

B.  Constrained  Maxima  and  Saddle-points. 

The  numbers  zlt  .  . .,  z9  form  the  constrained  maximum  of 
f(zlt  .  .  .,  zp)  subject  to  the  constraints 

(20)  SA 2,)^0  (j=l,...,s), 

if  f(zv  .  .  .,  zp)  ^  f(zv  .  .  .,  zv)  for  all  combinations  of  values  of 
zlt  .  .  .,  z p  which  satisfy  (20).  Again,  we  may  and  will  restrict 
ourselves  to  non-negative  values.  The  resource  allocation  problem 
of  part  I,  section  C  deals  with  such  a  constrained  maximum; 
here  them's  are  interpreted  to  include  the  x's  and  the  y's,  f(zv...,zp) 
is  identified  with  U(yv  .  .  .,  yn)  (the  fact  that  U  does  not  depend 
upon  all  the  z's  does  not  create  any  difficulty),  and 

m 

(21)  ft(*„.../i,)  =  {« 

2g«(*i)+6i  (»  =  »  +  l,...fs). 

3=1 

When  the  constraints  (20)  are  in  the  form  of  equalities  rather 
than  inequalities  and  the  variables  are  not  restricted  to  be  non- 
negative,  the  classical  method  of  Lagrange  multipliers  supplies 
a  necessary  condition  for  a  constrained  maximum.  That  is, 
in   order   that   zv  .  .  .,  z9   maximize    f(zlt  .  .  .,  z9)    subject    to 

29)  A  convex  function  is  one  whose  negative  is  concave;  that  is,  /  is  convex 
if  —/is  concave. 
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gj{z1}  .  .  .,  zv)  =  0  (j  =  1,  .  .  .,  s),  it  is  necessary  that  there  exist 
flf  .  .  .,  ps  such  that 

(22)  K+thii****       (»=1.---^). 

&&,  . ,  .,  *,)  =  0    (/  ==  1,  .  .  .,  s). 

Here,  g,- >z  =  dg^dz^  This  condition  has  been  much  used  in 
economics.30)  Thus,  the  theory  of  consumer's  behavior  makes 
use  of  a  special  case  of  (22),  where  z/s  are  the  amounts  of 
different  commodities  purchased,  f(zlt  .  .  .,  zp)  is  the  utility 
derived  from  a  given  bundle  of  commodities,  and  there  is  just 
one  constraint,  the  budgetary  constraint,  which  can  be  written 
in  the  form 

where  qi  is  the  price  of  commodity  i  and  M  is  total  income.  In 
this  case,  gjz  =  —  qt\  the  Lagrange  multiplier  px  is  interpreted 
as  the  marginal  utility  of  income. 

Another  application  of  (22),  this  time  using  more  than  one 
constraint,  occurs  in  Lange's  development  of  the  theory  of  welfare 
economics.31) 

Equations  (22)  can  be  rewritten  in  an  interesting  way  if  we 
introduce  the  Lagrangian  L,  defined  as, 

s 

(23)  L(zlt  . . .,  *-,;  plt  . . .,  j>n)=f{zlt  . . .,  z9)  +  2,p,gi{zv  . . .,  *,). 

3=1 

It  is  easy  to  see  that, 


(24)  dLjdz,  =  ft(  +  .J  fiSjt,t ,  dL/dfi,  =  g,{h  . . .,  x,). 

3=1 

In  view  of  (24),  the  condition   (22)  which  is  necessary  for  a 
maximum  is  that, 

(25)  Lz=  0,  L9§  =  0,  i  =  1,  .  .  .,  p,  j  =  1,  .  .  .,  s. 

This  form  is  rather  suggestive.  If  we  take,  for  example,  the  p/s 

30)  See  R.   G.   D.   Allen,   Mathematical  Analysis  for  Economists    (London: 
Macmillan,   1938)  pp.  364 — 83;  Samuelson,  Foundations,  pp.  262 — 64. 

31)  See  O.  Lange,  "The  Foundations  of  Welfare  Economics,"  Econometrica, 
X   (1942),  215—28. 
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as  given  numbers,  with  pj  =  pj  for  each  /,  then  (25)  shows  that 
the  z/s  satisfy  at  least  a  necessary  condition  for  a  maximum  or 
minimum.  Before  following  up  this  hint,  let  us  reconsider  the 
constrained  maximum  problem  in  the  form  first  proposed. 

We  want  to  change  the  problem  which  leads  to  (25)  in  two 
ways:  the  z/s  are  to  be  restricted  to  be  non-negative,  and  the 
constraints  are  to  be  inequalities,  as  given  in  (20).  Our  presenta- 
tion here  is  based  on  the  important  study  by  Harold  W.  Kuhn 
and  Albert  W.  Tucker.32)  Analogous  to  (14),  it  can  be  shown 
that  the  requirement  that  the  z/s  be  non-negative  requires 
modifying  the  first  half  of  (25)  to, 

(26)  LZi  ^  0     (*  =  1,  .  .  .,  f)\   if  LZ{  <  0,  then  z]  =  0. 

Considering  the  constraints  as  inequalities  requires  two 
modifications.  First,  suppose  that,  for  some  /,  gt(zlt  .  .  .,  zv)  >  0. 
Then  this  particular  constraint  is  ineffective,  in  that  the  choices 
of  the  z's  could  have  been  slightly  modified  without  violating 
the  constraints,  but  it  was  not  found  profitable  to  do  so.  Since 
the  solution  to  the  constrained  maximization  problem  should 
be  the  same  if  an  ineffective  constraint  is  dropped  from  the 
problem  completely,  we  would  expect    (correctly)   that, 

(27)  if  gj(zlf...,zv)>0,  then  £  =  0. 

The  second  point  is  that  the  p/s  must  be  non-negative.  The 
point  of  this  can  be  seen  by  considering  a  single  restraint  and 
giving  an  economic  interpretation  to  the  problem.  Let  z1}...,z^ 
be  some  economic  variables  (perhaps  outputs),  f(zlf  .  .  .,  zp) 
the  return  from  them  in  utility  or  money,  and  g(zv  .  .  .,  zv) 
the  excess  supply  of  some  resource,  that  is,  the  initial  amount 
available  of  that  resource  less  the  amount  needed  for  the  choice 
of  variables  zlt  .  .  .,  zp.  The  condition  that  g(zv  .  .  .,  z9)  ^  0 
simply  amounts  to  saying  that  more  cannot  be  used  of  the 
resource  than  is  available.  Assume,  for  simplicity,  that  the  z/s 
are  all  positive,  and  that  the  constraint  is  effective;  then, 

(28)  /gi  +  ^  =  0,  g(zlt  .  .  .,  *,)  =  0. 

32)  See  H.  W.  Kuhn  and  A.  W.  Tucker,  "Nonlinear  Programming,"  in  J. 
Neyman,  ed.,  Proceedings  of  the  Second  Berkeley  Symposium  on  Mathematical 
Statistics  and  Probability  (Berkeley  and  Los  Angeles:  University  of  California. 
Press   (1952),  pp.  481-92. 
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Consider  any  movement  of  the  z/s  which  will  increase  the  return 
f(zv  .  .  .,  zp)  as  compared  with  the  return  when  the  variables 
take  on  the  values  zv  .  .  .,  zp.  For  example,  suppose  a  small 
increase  in  zx  will  increase  f(zv  .  .  .,  zp).  It  must  be,  then,  that 
such  an  increase  is  ruled  out  as  being  infeasible  in  that  it  violates 
the  constraint  g(zv  .  .  .,  zp)  2>  0,  for  if  it  were  feasible,  the 
l/s  could  not  be  the  constrained  maximum.  That  is,  an  increase 
in  zx  will  increase  f(zv  .  .  .,  zp)  but  must  decrease  g(zv  .  .  .,  zp) 
below  zero.  This  requires  that, 

(29)  fH  >  0,  gZi  <  0; 

in  view  of  the  first  part  of  (28),  we  must  have  p  >  0. 

In  economic  terms,  we  can  imagine  that  instead  of  explicitly 
restraining  an  economic  unit  to  undertake  only  feasible  policies, 
we  permit  any  decisions  (whether  feasible  or  not)  on  the  z/s 
but  require  it  to  pay  a  price  ft  on  the  amount  of  the  limited 
resource  used.  We  also  let  it  use  any  part  of  the  initial  supply 
of  the  resource  or  sell  any  part  at  the  price  ft.  Then  the  unit's 
net  return  can  be  thought  of  as  f(zlf  .  .  .,  zp)  +  ft  g{h>  •  •  •>  zv)> 
which  is  the  Lagrangian.  Then  the  above  asserts  that,  by 
choosing  ft  to  have  the  non-negative  value  p,  the  unit  will  be 
constrained  not  to  violate  the  feasibility  condition. 

If  we  combine  (26),  (27),  and  the  preceding  discussion,  with 
the  aid  of  (20)  and  the  second  half  of  (24),  we  have  the  following 
theorem,  due  to  Kuhn  and  Tucker: 

Theorem    1.   A  necessary  condition  that  zv  .  .  .,  zp  be   a 

constrained  local  maximum  of  f(zv  .  .  .,  zp)   subject  to  the 

constraints  g3(%  .  .  .,  zp)  ^  0  (j  =  1,  .  .  .,  s),  with  all  variables 

non-negative,  is  that, 

(a)  zt  ^  0;  LZi  ^  0;  if  LZi  <  0,  then  z,  =  0     (i  =  1,  .  .  .,  ft); 

(b)  fa  ^  0;  LPj  ^  0;  if  LPf  >  0,  then  p,  =  0  tf  =  1,  .  .  .,  s).33) 

33)  Strictly  speaking,  the  above  theorem  is  true  only  if  the  constraint  functions 
gj{zx,  .  .  .,  zv)  satisfy  an  additional  condition,  such  as  that  referred  to  by  Kuhn 
and  Tucker  as  the  Constraint  Qualification  {op.  cit.,  pp.  483 — 4).  A  simple 
condition  of  this  type  (used  by  M.  Slater  is  an  unpublished  paper)  is  that  there 
exist  some  zlt  .  .  .,  z9  such  that  gf(zlt  .  .  .,  zp)  >  0  for  all  /;  in  economic  terms, 
that  it  be  possible  to  choose  the  economic  variables  so  that  there  is  an  excess 
supply  of  all  resources. 
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Remark.  It  is  to  be  stressed  that  Theorem  1  yields  only  a 
necessary  condition  for  a  local  maximum.  The  point  is  precisely 
analogous  to  that  which  has  already  been  made  for  unconstrained 
maxima.  Except  for  the  strict  inequalities  in  condition  (1),  a 
constrained  local  minimum  would  satisfy  Theorem  1 ;  even  with 
the  inequalities,  a  point  satisfying  conditions  (a)  and  (b)  might 
be  neither  a  maximum  nor  a  minimum  locally.  Further,  even  if 
we  have  a  local  maximum  satisfying  the  conditions  of  Theorem  1, 
there  is,  in  general,  no  assurance  that  it  will  be  a  global  maximum. 

Additional  conditions  on  the  functions  involved  are  needed 
to  insure  that  the  conditions  of  Theorem  1  suffice  for  a  global 
maximum.  In  view  of  the  corresponding  results  for  unconstrained 
maxima,  it  is  not  surprising  to  find  that  conditions  (a)  and  (b) 
are  sufficient  for  a  global  maximum  if  the  functions  f(zv  . .  .,  zp) 
and  gj(zlt  .  .  .,  zp)   (j  —  1,  .  .  .,  s)  are  all  concave  functions. 

Closely  related  to  this  problem  is  that  suggested  by  the 
economic  interpretation  given  in  the  discussion  leading  up  to 
Theorem  1,  namely  whether  or  not  the  economic  unit  can  be 
thought  of  as  choosing  zlt  .  .  .,  z9  so  as  to  maximize  the  Lagran- 
gian  when  the  ft/s  are  set  equal  to  plf  .  .  .,ps.  Condition  (a) 
shows  that  a  necessary  condition  for  a  maximum  must  be  satisfied 
(recall  condition  (14)).  It  is  certainly  not  always  true  that  the 
z/s  maximize  the  Lagrangian  given  the  p/s;  in  economic  terms, 
it  is  not  always  true  that  the  optimal  allocation  of  resources  will 
be  achieved  by  maximizing  profits  even  when  the  prices  of 
resources  are  properly  set.34)  But  there  is  an  important  class  of 
cases  when  in  fact  we  can  think  of  maximizing  the  Lagrangian 
with  respect  to  the  z/s.  Indeed,  we  know  from  earlier  discussion 
(section  A.  2,  this  part)  that  condition  (a)  of  Theorem  1  is 
sufficient  for  a  maximum  if  the  Lagrangian  is  a  concave  function 
of  the  z/s  (taking  the  p/s  as  given  at  pv  .  .  .,p8).  In  turn,  we 
can  say  that  this  condition  is  satisfied  if  f(zv  .  .  .,  zv)  and 
gj(zv  .  .  .,  z9)  (j  =  1,  .  .  .,  s)  are  all  concave  functions,  for  then 
L(zv  .  .  .,  zp;  p1}  .  .  .,  ps)  is  a  combination  of  concave  functions 
with  non-negative  coefficients,  and  any  such  combination  is 
again  concave. 

34)  For  an  emphatic  statement  of  this  viewpoint,  see  Samuelson,  Foundations, 
pp.  230—31,  234—35. 
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(30)  If  the  functions  f(zv  .  .  .,  zp)  and  gj[zlf  .  .  .,  zp)  are  concave, 
then  the  maximum  of  L(zv  .  .  .,  zv\  pv  .  .  .,  pn)  is  achieved 
by  setting  zt  =  Mt  (*'  =  1,  .  .  .,  p). 

In  view  of  the  parallehsm  between  conditions  (a)  and  (b) 
of  Theorem  1,  it  is  natural  to  ask  if  the  p/s  minimize  the  Lagran- 
gian  given  that  zi  =  z{  for  all  i.  Here,  this  is  clearly  true,  because 
the  Lagrangian  is  a  linear  function  of  the  p/s  and  all  linear 
functions  are  convex  (as  well  as  concave),  so  the  result  follows 
from  (18). 

(31)  The  minimum  of  L(zlf  .  .  .,  z9\  pv  .  .  .,  ps)  is  achieved  by 
setting  pj=pj  (j  =  1,  .  .  .,  s). 

Statements  (30)  and  (31)  suggest  the  convenience  of  the 
following  definition  as  applied  to  any  function  depending  upon 
two  sets  of  variables  (which  we  will  here  regard  as  non-negative). 
Definition  2.  The  function  &{zlf  .  .  .,  zp;  pv  .  .  .,  pa)  is 
said  to  have  a  saddle-point  at  (zv  .  .  .,  zp;  plt  .  .  .,  ps)  if 
0(zv  .  .  .,zv;  pv  .  .  .,p8)  has  its  maximum  in  zlf  .  .  .,  zp  at 
zv  .  .  .,  z p   and   &(zlt  .  .  .,  zp;    pv  .  .  .,  ps)    has   it   minimum   in 

Pv  •  •  ->Ps  at  Pv  •  •  ->Pn- 

The  concept  of  a  saddle-point  was  used  by  von  Neumann 
and  Morgenstern  in  connection  with  the  theory  of  zero-sum 
two-person  games.35)  The  definition  just  given  is  not  the  most 
general  possible,  but  it  is  sufficient  for  the  present  purposes. 
It  is  useful  to  think  of  a  game  in  which  player  I  chooses  p  real 
numbers,  zv  .  .  .,  zp,  player  II  chooses  s  real  numbers,  pv  .  .  .,  ps, 
the  two  choices  being  made  independently,  and  the  amount  paid 
by  player  II  to  player  I  is  given  by  the  function  0(zv  .  .  .,  zp, 
Pi>  -  -  •»  Ps)-  (^  negative  value  means  that  player  I  pays 
player  II.)  Clearly  player  I  wishes  to  maximize  the  function 
0(zlf  .  .  .,  zp]  pv  .  .  .,  ps)  with  respect  to  the  variables  zv  .  . .,  zv 
at  his  control  for  any  given  choice  of  player  IFs  variables,  while 
player  II  wishes  to  minimize  with  respect  to  his  variables. 

The  discussion  can  be  summarized  in  the  following  theorem 
due  to  Kuhn  and  Tucker: 

35)  J.  von  Neumann  and  O.  Morgenstern,  Theory  of  Games  and  Economic 
Behavior  (1st  ed.;  Princeton,  New  Jersey:  Princeton  University  Press,  1944), 
p.  95. 
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Theorem  2.  If  f(zv  .  .  .,  z9)  and  g4(zlt  .  .  .,  zv)  (j  =  1,  .  .  .,  n) 
are  concave  functions  of  the  non-negative  variables  zlt  .  .  .,  z^,,, 
then  a  necessary  and  sufficient  condition  that  zlf  .  .  .,  ^^ 
maximize  /(^ . . .,  2J  subject  to  the  constraints  gj  (zlf . . ._,  zv)  ^  O 
(/  =  1,  .  .  .,  s)  is  that  there  exist  numbers  pv  .  .  .,  ^s  such: 
that  the  Lagrangian, 

L(zlt  .  .  .,  zp;  plf  .  .  .,  ps)  =  f(zv  .  .  .,  z9)  +  2  ftfofe  •  •  ■»  ^)i- 

i=l 

has  (zv  .  .  .,  f  j,;  p1}  .  .  .,  p8)  as  a  saddle-point  with  all  variables 

being  regarded  as  non-negative.36'37) 

Theorem  2  has  considerable  economic  interest  from  a  static 
point  of  view,  i.e.,  as  a  characterization  of  the  optimal  resource 
allocation.  However,  we  are  here  more  interested  in  its  im- 
plications for  a  process  of  successive  approximations.  In  the 
saddle-point  problem,  the  variables  are  unconstrained  (except 
that  they  must  be  non-negative);  hence,  it  appears  possible 
that  a  gradient  method  will  make  some  sense.  The  equivalence 
of  the  saddle-point  and  constrained  maximum  problems  can  then 
be  used  to  apply  the  resulting  process  to  the  latter  problem. 

C.  Local  Saddle-point  Conditions  for  Constrained  Maxima. 

Before  discussing  the  gradient  method  for  saddle-points,  we 
will  present  some  local  theorems  analogous  to  Theorem  2.  If 
the  concavity  conditions  are  not  satisfied,  we  cannot  expect 
to  be  able  to  state  simple  conditions  which  could  characterize  a 
global  maximum  or  distinguish  it  from  a  local  maximum.  However, 
it  would  at  least  be  worthwhile  to  find  necessary  and  sufficient 
conditions  for  a  local  maximum.  Theorem  1  does  not  satisfy  this 
requirement:  it  only  states  necessary  conditions. 

36 )  Again,  the  above  theorem  is  true  only  if  the  constraint  qualification  holds; 
see  footnote  33. 

37)  The  equivalence  relation  between  saddle-points  and  constrained  maxima 
was  previously  studied  in  the  case  where  the  functions  f(zv  . . .,  zP),  gj{zx,  . . .,  zv) 
are  linear,  that  is,  in  the  case  of  linear  programming,  by  D.  Gale,  H.  W.  Kuhn 
and  A.  W.  Tucker,  "Linear  Programming  and  the  Theory  of  Games,"  ch.  XIX 
in  Activity  Analysis,  pp.  317 — 29,  andG.  B.  Dantzig,  "A  Proof  of  the  Equivalence 
of  the  Programming  Problem  and  the  Game  Problem,"  ch.  XX,  ibid.,  pp. 
330 — 35.  The  Kuhn-Tucker  theorem  given  in  the  text  presents  a  somewhat 
different  form  of  the  relation  between  the  two  types  of  extrema  as  well  as  an 
extension  to  nonlinear  maximand  and  constraints. 
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Theorem  2  suggests  that  a  local  constrained  maximum  might 
be  characterized  by  the  local  saddle-points  of  a  Lagrangian. 
This  is  not  true  if  we  take  the  Lagrangian  in  the  form  in  which 
it  has  been  used,  but  it  is  true  for  modifications  of  the  original 
Lagrangian,  of  which  we  present  two.  First,  we  will  show  by 
an  example  that  a  local  constrained  maximum  is  not  in  general 
a  local  saddle-point  of  the  Lagrangian.  Let  the  maximand  be 
f(zlt  z2)  ==  z\  +  z\  and  the  constraint  g(zv  z2)  =  1  —  z±  —  z2  ^  0. 
It  is  easy  to  see  that  the  maximum,  for  non-negative  variables, 
is  attained  at  two  points,  (0,  1)  and  (1,  0).  Let  us  consider  the 
second.  The  Lagrangian  is, 

(32)  L(zv  z2,  p)=z*  +  4  +  P(l  ~  zi  ~  *2), 

By  Theorem  1,  at  the  maximum,  Lz  =  0  (since  z1=  1  >  0), 
so  that  p  =  2.  If  the  optimal  solution  were  a  local  saddle-point, 
the  point  (1,0)  would  be  a  local  maximum  of, 

(33)  L(zvz2;p)  =  z21  +  zj+2(l-z1-z2)  =  (*x-l)«  +  fe-1)2. 

But  clearly  any  change  in  zx  would  increase  (33),  so  that  the  point 
cannot  be  a  local  maximum  of  (33)  and  therefore  not  a  local 
saddle-point. 

Our  program,  then,  is  to  search  for  a  modification  of  the 
Lagrangian  so  that  a  constrained  maximum  will  correspond  to 
a  saddle-point.  First  we  remark  that  no  difficulty  occurs  with 
respect  to  the  minimization  part  of  Definition  2,  as  we  have 
already  seen  in  equation  (31);  that  is,  the  p/s  minimize  the 
Lagrangian  L(zlf  .  .  .,  zv;  pv  .  .  .,  ps)  with  respect  to  the  p/s 
regardless  of  assumptions  about  the  functions  f(zv  .  .  .,  zp), 
gy(zv  .  .  .,  zv).  This  is  because  the  Lagrangian  is  linear  in  the 
p/s.  Our  concern  therefore  is  to  modify  the  Lagrangian  so  that 
it  be  locally  concave  in  the  z/s  when  the  p/s  are  set  equal  to 
their  equilibrium  values.  More  precisely,  we  shall  seek  to  make 
the  Lagrangian  strictly  concave  locally  in  the  z/s;  the  strict 
concavity  will  be  important  in  the  applications  of  the  gradient 
method  to  be  discussed  in  the  next  section.  We  will  refer  to 
Lagrangians  modified  so  as  to  have  the  desired  concavity 
properties  as  concavified  Lagrangians. 

Our  first  method  for  concavifying  Lagrangian  is  based  on 
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the  remark  that  the  constraints  in  a  given  maximization  problem 
can  be  described  in  different  ways.  This  is  analogous  to  the  well- 
known  proposition  in  consumers'  demand  theory  that  monotone 
transformations  of  the  utility  function  leave  the  demands 
unchanged,  the  point  being  that  the  set  of  variables  which 
maximizes  a  function  also  maximizes  any  increasing  function 
of  it.  In  the  same  way,  it  is  possible  to  transform  the  constraints 
without  changing  the  set  of  values  of  the  variables  which  satisfy 
them.  Let  pt{us)  be  any  (real- valued)  function  of  one  variable 
which  preserves  signs,  that  is,  pj{ut)  is  positive,  negative  or  zero 
according  as  ui  is  positive,  negative,  or  zero,  respectively.  Let 
one  of  the  constraints  in  a  maximization  problem  be  that 
gj{zv  .  .  .,  z p)  ^  0.  Define  a  new  function  of  the  variables 
*i,  •  •  •,  **, 

(34)  gf(zv  .  .  .,  zJPi)  =  pfes{zv  .  .  .,  *,)]. 

In  other  words,  for  any  given  set  of  values  of  the  variables  zi% 
the  corresponding  value  of  gf  is  obtained  by  first  computing 
gj(zv  .  .  .,  zp),  and  then  computing  the  value  of  p^Uj)  when 
Uj  is  set  equal  to  the  value  of  g^z^  .  .  .,  zp).  Since  the  function 
pAui)  preserves  signs,  it  follows  from  (34)  that 

(35)  g*(zv  .  .  .,  zJPj)  ^  0  if  and  only  if  gj(zv  .  .  .,  z9)  ^  0. 

The  constraints  then  are  unchanged  by  the  transformation. 
For  any  sign-preserving  functions  pj(ff;)>  any  values  of  the 
variables  zv  .  .  .,  zp  which  satisfy  the  original  constraints. 
gj(z1,  .  .  .,  zp)  ^  0  also  satisfy  the  transformed  constraints, 
g*  (zv  .  .  .,  zJpj)  ^  0,  and  conversely  therefore  in  particular 
the  values  zlt  .  .  .,  zv  which  maximize  f(zlf  .  .  .,  zv)  under  the: 
original  constraints  are  the  same  as  those  which  maximize  it 
under  the  transformed  constraints.  For  the  transformed  problem, 
the  Lagrangian  becomes 

(36)  L*{z1,...,zp;t1,...,tJP) 

S 

=  f{*i,  •  •  ./*,)  +  2  pjg?(h>  •  •  •>  zJpi)> 

The  necessary  conditions  of  Theorem  1  must  hold  for  all  the* 
Lagrangians  formed  by  transforming  the  constraints. 
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The  problem  of  concavification  thus  becomes  one  of  choosing 
among  the  Lagrangians  with  transformed  constraints  a  class  of 
Lagrangians  for  which  the  concavity  property  holds.  We  want 
however  a  class  which  can  be  defined  rather  broadly  in  advance 
of  solving  the  problem;  there  is  no  point  in  merely  showing  the 
existence  of  a  set  of  functions  p^u/j  for  which  the  Lagrangian 

(36)  is  locally  strictly  concave  in  the  z/s  when  pi  =  pj  for  all 
/  if  the  determination  of  that  set  is  as  hard  as  the  original 
maximization  problem.  Fortunately,  it  is  possible  to  specify  a 
whole  class  of  transforming  functions  /o,(%)  which  depend  only 
on  a  very  general  knowledge  of  the  functions  involved. 

We  will  present  the  result  in  terms  of  a  very  specific  type  of 
transforming  function.  Note  that  since  Pj(us)  is  negative  for 
us  negative  and  positive  for  ui  positive,  it  must  be  increasing 
at  Uj  =  0.  Let  us  consider  the  class  of  functions, 

(37)  Pi(ui)  =  l-{l-uiY+ii, 

with  7]i  an  even  integer  which  has  this  property.  The  following 
theorem  can  be  demonstrated: 

Theorem  3.  Under  certain  regularity  conditions,  for  all  even 
r]/s  sufficiently  large,  a  necessary  and  sufficient  condition  that 
zv  .  .  .,  zp  be  a  local  maximum  of  f(z1,  .  .  .,  zp)  subject  to  the 
constraints  gj(zv  .  .  .,  zp)  ^  0  is  that  there  exist  numbers 
pv  .  .  .,  p8  such  that  the  Lagrangian, 

£*(*!,...,*,;  Pv;psM 

=  f(h> .  ■ ,  *,)  +  2  Pi  (i  -  [i  -  g,(*i,  •  • ,  *;}j1+"). 

3=1 

has  (zv  .  .  .,  z p;  pv  .  .  .,  ps)  as  a  local  saddle-point,  all  variables 
being  considered  as  non-negative.  In  particular,  the  function 
L*(zv  .  .  .,  zp;  pv  .  .  .,  pjrj)    is    locally   strictly   concave   in 

*v  •  •  -,  z9.  38) 

Notice  that  the  transformation  can  be  chosen  from  among  a 
wide  class  since  all  rj/s  sufficiently  large  will  suffice.  The  particu- 

38 )  For  a  demonstration  and  more  complete  spelling  out  of  the  regularity 
conditions,  see  K.  Arrow  and  L.  Hurwicz,  "Reduction  of  Constrained  Maxima 
to  Saddle-point  Problems,"  in  J.  Neyman,  ed.,  Proceedings  of  the  Third  Berkeley 
Symposium  on  Mathematical  Statistics  and  Probability  (Berkeley  and  Los 
Angeles:  University  of  California  Press,  1957),  V,  1 — 20. 
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lar  choice  of  transformation  function  (37)  is  not  essential;  the 
basic  condition  is  that  the  ratio, 

ft"(0)/ft'(0), 

be  sufficiently  large.    Another  example  is  pj(Uj)  =  1  —  erW. 

We  will  refer  to  the  Lagrangian  of  Theorem  3  as  the  concavified 
Lagrangian  with  transformed  constraints.  It  is  in  fact  a  Lagrangian 
in  the  ordinary  sense  applied  to  a  problem  which  is  identical 
with  the  original  but  has  the  constraints  restated. 

Before  leaving  this  method  of  concavification,  we  note  that 
it  can  be  applied  to  the  situation  where  the  maximand  f(zv . . .,  zv) 
and  the  constraints  gj(zt,  .  .  .,  z9)  are  linear.  In  this  case,  Theo- 
rem 2  is  applicable,  but  the  Lagrangian  is  linear  in  the  z/s, 
not  strictly  concave.  The  latter  property,  as  we  have  remarked, 
is  desirable  in  applications  of  the  gradient  method  to  be  discussed 
in  the  next  section.  The  class  of  transformation  functions  which 
will  insure  something  close  to  strict  concavity  of  the  Lagranian 
is  very  wide  indeed,  including  all  strictly  increasing  strictly 
concave  functions.  An  application  to  the  resource  allocation 
problem  in  the  linear  ease  will  be  made  in  section  G,  part  IV. 

We  now  proceed  to  an  alternative  method  of  achieving  a 
concave  Lagrangian,  based  on  a  lemma  due  to  Debreu.39)  In 
this  case,  the  Lagrangian  is  modified  by  subtracting  a  quadratic 
function  of  the  constraints;  we  will  therefore  refer  to  it  as  the 
concavified  Lagrangian  with  quadratic  modification.  First,  con- 
sider the  case  in  which  the  constraints  are  equalities  rather  than 
inequalities,  that  is,  the  problem  is  to  maximize  f(zv  .  .  .,  zp) 
subject  to  the  constraints  gj(zv  .  .  .,  z9)  =  0.  Then  consider  the 
expression, 

(38)     L1(Mv..'vx,;pv...,f>JX) 

s  s 

=  /(*i,  •  •  .,  *,)  +  2>j&(*i»  •  •  •>  *p)  -*1  lSi(zv  •  •  •>  **)!*> 

3=1  3=1 

which  differs  from  the  usual  Lagrangian  only  in  its  last  term. 
It  is  easy  to  see  that  the  first-order  conditions  for  a  constrained 
maximum  are  the  same  for  the  Lagrangian  with   quadratic 

39)  G.  Debreu,  "Definite  and  Semidefinite  Quadratic  Forms,"  Econometrica, 
Vol.  20   (1952),  296. 
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modification  as  for  the  usual  Lagrangian.  But  it  can  also  be 
shown,  that,40) 

(39)     L1(zv  .  .  .,  zp;    pv  .  .  .,pJX)    is    locally    strictly    concave 
in  xv  .  .  .,  z p  for  all  X  sufficiently  large. 

The  case  of  constraints  which  are  inequalities  can  now  be 
covered  by  introducing  new  non-negative  variables  wv  .  .  .,  ws, 
and  observing  that  the  condition  g§{zlt  .  .  .,  zp)  ^  0  is  equivalent 
to  the  condition  g3{zlf  .  .  .,  zp)  —  Wj  =  0.  The  constraints  now 
being  written  in  the  form  of  equalities,  we  can  apply  (39), 
provided  it  is  observed  that  the  set  of  variables  with  respect  to 
which  maximization  is  carried  out  has  been  enlarged  by  the 
addition  of  the  w/s. 

Theorem  4.  Under  certain  regularity  conditions,  for  all  X 
sufficiently  large,  a  necessary  and  sufficient  condition  that 
zlt  .  .  .,  zP  be  a  local  maximum  of  f(zv  .  .  .,  zp)  subject  to  the 
constraints  gi(zv  .  .  .,  zv)  ^  0  if  there  exist  pv  .  .  .,ps  and 
wv  .  .  .,  ws  (these  numbers  are  the  same  for  all  X),  such  that 
the  Lagrangian  with  quadratic  modification, 

H(zlf  .  .  .,  zp,  wv  .  .  .,  ws;  plt  .  .  .,  pJX) 

s  s 

=  L{zlf  .  .  .,  zp\  pv  . . .,  ps)  -  2  pjWj  -  X2[gj(zv  •  •  •>  zp)  ~  w*]a. 

3=1  j=l 

have    (zv  .  .  .,  z p,   wv  .  .  .,  ws\  pv  .  .  .,ps)   as   a  local  saddle- 
point,    all   variables   being   considered   as   non-negative.    In 
particular  the  function  lS(zv  .  .  .,  zp,  wv  .  .  .,  ws;  plt  .  .  .,  pJX) 
is  locally  strictly  concave  in  zv  .  .  .,  zp,  wlt  .  .  .,  ws. 
Remark.  When  the  constraints  are  equalities,  the  equilibrium 
values  of  the  Lagrange  multipliers  pj  may  be  positive  or  negative, 
and  the  condition  gj{pv  .  .  .,  ps)  —  0  is  the  same  as  the  condition 
that  Lp  =  0.  Since  the  Lagrangian  is  linear  in  the  p/s,  this 
means  that  for  equilibrium  values  of  the  z4's,  it  is  a  constant 
with  respect  to  the  p/s.  In  particular,  one  can  speak  of  it  as 
taking  a  minimum  with  respect  to  the  p/s  when  pj  =  p^  here 

40)  See  K.  Arrow  and  R.  M.  Solow,  "The  Gradient  Method  for  Constrained 
Maxima  Under  Weakened  Conditions,"  ch.  10  in  K.  Arrow,  L.  Hurwicz,  and 
H.  Uzawa,  Studies  in  Linear  and  Nonlinear  Programming  (Stanford,  California: 
Stanford  University  Press,  1958),  especially  section  4.  The  result  is  essentially 
an  extension  of  Debreu's  lemma  to  cover  the  case  of  non-nesrative  variables. 
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the  range  of  variation  of  the  p/s  is  unrestricted  as  to  sign. 
The  same  statements  hold  for  the  Lagrangian  with  quadratic 
modification,  since  the  p/s  do  not  enter  the  additional  quadratic 
term.  It  follows  that  in  the  saddle-point  statement  of  Theorem  4, 
one  could  properly  say  that  the  p/s  should  be  unrestricted  as  to 
sign,  instead  of  being  non-negative.  However,  it  can  be  proved 
that  when  the  constraints  are  introduced  in  the  special  form 
used  there,  with  a  slack  variable  wi  to  transform  inequalities 
to  equalities,  a  saddle-point  with  respect  to  non-negative  values 
of  the  p/s  is  also  a  saddle-point  with  respect  to  unrestricted 
variations. 

D.  The  Gradient  Method  for  Saddle-points. 

Let  us  imagine  that  a  game  with  a  saddle-point  is  played 
frequently,  in  fact  continuously.  At  any  given  trial,  player  I 
knows  what  player  II  has  chosen  the  previous  time.  Suppose 
player  I  assumes  that  player  II  will  make  the  same  choice  of 
his  variable  plf  .  .  .,ps  on  the  given  trial.  Then  from  his  point 
of  view,  the  problem  is  one  of  unconstrained  maximization.  He 
may  then  be  thought  of  as  starting  a  gradient  process  to  achieve 
this  maximum,  so  that  this  behavior  is  described  by  applying 
(15)  to  the  function  &{zlt  . . .,  zP;  pv  . . .,  ps),  with  the  variables 
plt  .  .  .,ps  regarded  as  given.  Thus, 

v     }  ll  \d&ldzt  otherwise. 

Actually,  however,  the  p/s  are  also  varying.  If  player  II  is 
thinking  along  the  same  lines  as  player  I,  he  will  seek  to  minimize 
the  payoff  function  with  respect  to  plt .  .  .,  ps,  taking  zls .  .  .,  z9 
as  given.  Equation  (19)  is  applicable  here,  so  that, 

fO  if  pj  =  0  and  d&jdp,  >  0, 


(41)  dpJdt 

v     '  ^  {  —  d0/dpj   otherwise. 

We  wish  to  state  conditions  for  the  convergence  of  the  gradient 
method  defined  by  equations  (40)  and  (41).  We  cannot  simply 
apply  the  earlier  statements  about  the  convergence  of  the  gradient 
process  for  an  unconstrained  maximum.  Indeed,  if  &(zlf  .  .  .,  zv\ 
Pi>  -  •  •>  i>s)  is  strictly  concave  in  the  variables  zv  .  .  .,  zv,  taking 
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pv  .  ,  .,  ps  as  given,  then  the  process  (40)  would  converge  if  the 
p/s  remained  constant.  But  in  fact  the  latter  variables  are 
simultaneously  changing  as  a  result  of  (41).  We  may  think  of 
an  anti-aircraft  gun  shooting  at  an  airplane.  After  each  shot, 
the  gunner  may  move  the  gun  so  as  to  correct  for  the  observed 
error,  but  meanwhile  the  airplane  is  moving  away  from  its 
previous  position  and  in  such  a  way  as  to  make  the  error  as  big 
as  possible.  Despite  the  complexity  of  the  problem,  it  is  possible 
to  show  the  following  result, 

(42)  if  &(zv  .  .  .,  zp;  pv  .  .  .,  ps)  is  strictly  concave  in  zx,  .  . '.,  zv 
for  each  set  of  values  of  pv  .  .  .,  ps  and  convex  in  pv  .  .  .,  ps 
for  each  set  of  values  of  zv  .  .  .,  zp)  then  the  gradient 
process  described  by  equations  (40)  and  (41)  converges 
to  a  saddle-point  (zv  .  .  .,  z 9\  pv  .  .  .,  ps)  of  0{zx,  .  .  .,  zv; 

Pi,:,:,Ps).*1'*2) 

That  somewhat  stronger  conditions  than  the  simple  concavity 
assumptions  of  Theorem  2  are  needed  for  convergence  of  the 
gradient  method  can  be  seen  from  the  following  example.43) 
Suppose  there  is  just  one  z  and  one  p,  and, 

0{Z\    p)   =  Z  +  p  —  pZ. 

It  is  easy  to  verify  that  the  unique  saddle-point  is  z  =  I,  p  =  I. 
If  we  disregard  for  the  moment  the  non-negativity  conditions, 
the  gradient  method  becomes, 

41 )  The  assumptions  of  (42)  insure  that  there  is  a  saddle-point  and  that 
the  z- values  must  be  uniquely  defined;  it  is  possible  however  to  have  more  than 
one  set  of  ^-values. 

42 )  The  fact  that  the  z- values  converge  to  the  z- values  of  a  saddle-point  has  been 
proved  with  local  assumptions  and  conclusions  by  the  authors;  see  K.  J.  Arrow 
and  L.  Hurwicz,  "The  Gradient  Method  for  Concave  Programming  I:  Local 
Results,"  ch.  6  in  Arrow,  Hurwicz,  and  Uzawa,  op.  cit.  The  corresponding 
theorem  in  the  large  was  established  by  H.  Uzawa,  "The  Gradient  Method  for 
Concave  Programming  II:  Global  Results,"  ch.  7,  ibid.  That  the  ^-values  also 
converge  is  shown  in  K.  J.  Arrow  and  L.  Hurwicz,  "The  Gradient  Method  for 
Concave  Programming  III:  Further  Global  Results  with  Applications  to 
Resource  Allocation,"  ch.  8,  ibid,  referred  to  below  as,  "Gradient  Method 
for  Resource  Allocation,"  see  section  1.  For  an  earlier  exposition  of  the  gradient 
method  as  applied  to  constrained  maxima  and  to  saddle-points,  see  K.  J. 
Arrow  and  L.  Hurwicz,  "Gradient  Methods  for  Constrained  Maxima,"  Opera- 
tions Research,  Vol.  5   (1957),  258—65. 

43)  The  non-convergence  of  the  gradient  method  in  the  linear  case  was 
observed  by  Samuelson;  see  footnote  22. 
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dzjdt  =  1  —  ft,  dft/dt  =  z  —  1. 
The  solution  of  this  system  of  differential  equations  is, 

z  =  1  +  (*o  ~~  1)  cos  £  +  (1  —  #o)  sm  *> 

^>  =  1  +  {po  —  1)  cos  t  +  (20  —  1)  sin  £,  # 

where  (£0,  ^>0)  is  the  initial  point  of  the  adjustment  path.  It  is 
easy  to  see  that  if  z0  and  ft0  are  sufficiently  close  to  1,  the  solution 
never  becomes  negative,  so  that  the  above  is  in  fact  the  solution 
to  the  gradient  method  of  (40)  and  (41)  with  non-negativity 
conditions  satisfied.  Also  it  is  clear  that  the  solution  is  a  periodic 
function  and  so  never  converges  to  the  saddle-point.  Instead  it 
cycles  endlessly,  neither  diverging  explosively  not  converging.44) 

E.  The  Gradient  Method  for  Constrained  Maxima. 

We  are  now  in  a  position  to  combine  the  results  of  sections  B 
and  D.  In  section  B,  it  was  stated  that  a  constrained  maximum 
problem  was  equivalent  to  a  suitable  saddle-point  problem.  In 
section  D,  it  was  remarked  that,  under  certain  condition,  a 
saddle-point  could  be  determined  by  a  gradient  process.  We  have 
merely  to  apply  (42)  to  the  Lagrangian  of  Theorem  2. 

First,  when  are  the  hypotheses  of  (42)  satisfied  by  the  La- 
grangian? The  Lagrangian  L(zv  . .  .,  zp\  ftv  .  .  ..,  fts)  is  linear  in 
pl9  .  .  .,  fts  for  any  given  set  of  z's  and  hence  certainly  convex. 
Suppose  that  f(zv  .  .  .,  zp)  is  strictly  concave  and  the  g/s  all 
concave.  Then  for  any  given  set  of  non-negative  ft/s,  the  La- 
grangian is  a  sum  of  concave  functions,  one  of  which  is  strictly 
concave;  hence  the  Lagrangian  is  strictly  concave.  Thus,  the 
Lagrangian  satisfies  the  hypotheses  of  (42)  if  f(zv  .  .  .,  zp)  is 
strictly  concave  and  g4{zv  .  .  .,  zv)  is  concave  for  each  /. 

To  describe  the  gradient  method,  we  must  substitute  the  La- 
grangian L(zv  .  .  .,  z9;  ftv  .  .  .,  fts)   for  0(zv  .  .  .,  *,;  ftv  .  .  .,  fts) 
n   (40)  and  (41).  We  can  then  assert, 

Theorem   5.     If    f(zv  .  .  .,  zp)    is    strictly     concave     and 

gj{zv  .  .  .,  zP)  is  concave  for  each  /,  then  the  gradient  process 

defined  by, 

44)  This  fact  was  one  of  the  reasons  for  interest  in  modified  Lagrangians 
where  convergence,  rather  than  endless  cycling,  could  be  obtained. 
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m     ^=foif^=0and/-+l^<0 

l/.,  +  2&&..,  otherwise, 

3=1 

(44  d^/d*  = 

^—  &(*i»  •  •  ->  zv)  otherwise, 

converges  to  a  limit  (zlf  .  .  .,  zv\  pv  .  .  .,  ps),  where 
(zv  .  .  .,  zp)  is  the  constrained  maximum  of  f(zv  .  .  .,  z9) 
subject  to  the  restraints  gj(zv  .  .  .,  zp)  ^  0,  where  the  variables 
zt  are  non-negative. 

The  gradient  method  of  Theorem  5  can  be  given  an  economic 
interpretation  along  the  lines  used  in  section  B.  If  f{zv  .  .  .,  zp) 
is  considered  to  be  the  utility  derived  from  setting  some  variables 
at  levels  zv  .  .  .,  zp,  and  gj(zv  .  .  .,  zp)  is  the  excess  supply  of 
resource  /,  then  pi  can  be  regarded  as  the  price  of  resource  /. 
Since  gu  is  the  increase  in  excess  supply  (decrease  in  excess 
demand)  due  to  a  unit  increase  in  zit  we  can  interpret, 

n 
j=l 

as  the  marginal  cost  attributable  to  a  unit  increase  in  z€.  Hence, 
the  expression, 

n 

/«!  +  2  &&,!,' 

3=1 

can  be  interpreted  as  the  difference  between  marginal  utility 
and  marginal  cost  attributable  to  a  unit  increase  in  zt.  Hence 
(43)  is  an  instruction  to  increase  zt  if  the  marginal  utility  exceeds 
the  marginal  cost  and  decrease  it  otherwise,  with  the  proviso 
that  the  activity  level  zt  cannot  be  decreased  below  zero.  Equa- 
tion (44)  has  even  a  simpler  interpretation,  to  increase  price  if 
the  excess  supply  is  negative  (i.e.,  if  demand  exceeds  supply) 
and  decrease  it  otherwise,  but  not  below  zero.  The  theorem 
then  asserts  that  the  process  just  described  will  eventually 
converge  to  the  activity  levels  which  yield  the  highest  utility 
subject  to  the  conditions  that  demand  never  exceed  supply 
for  any  commodity.  The  process  will  also  define  a  corresponding 
set  of  equilibrium  prices. 
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Theorem  5  is  not  in  present  form  applicable  to  the  resource 
allocation  model  of  part  I,  section  C  even  under  the  strongest 
reasonable  concavity  assumptions.  Somewhat  stronger  theorems 
are  presented  in  part  III,  section  D. 

The  proposition  contained  in  (42)  can  be  applied  locally  to 
the  concavified  Lagrangians  introduced  in  section  C.  Theorems  3 
and  4  then  yield  the  following  corresponding  results. 

Theorem  6.  If  zv  .  .  .,  zp  is  a  local  maximum  of  f(zlt  .  .  .,  zp) 
subject  to  the  constraints  gj(zv  .  .  .,  zp)  ^  0  and  certain 
regularity  conditions  are  satisfied,  then  the  gradient  method 
applied  to  the  concavified  Lagrangian  with  transformed 
constraints  L*(zlt  .  .  .,  zp;  plf  .  .  .,  pjrj)  with  rj/s  sufficiently 
large,  converges  to  a  saddle-point  (zv  .  .  .,  z p;  p^'.  .  .,  ps) 
if  the  initial  approximation  is  sufficiently  close. 

Theorem  7.  If  zv  .  .  .,  zp  is  a  local  maximum  of  f(zv  .  .  .,  zp) 
subject  to  the  constraints  gt(zlt  .  .  .,  zp)  ^  0  and  certain 
regularity  conditions  are  satisfied,  then  the  gradient  method 
applied  to  the  concavified  Lagrangian  with  quadratic  modi- 
fication, L*(zv  .  .  .,  zp,  wlt  .  .  .,  ws;  pv  .  .  .,  pJX)  with  X  suf- 
ficiently large,  converges  to  a  saddle-point  (zlt  .  .  .,  zp, 
wv  .  .  .,  ws;  plf  .  .  .,  ps)  if  the  initial  approximation  is  suf- 
ficiently close. 

F.  The  Price- Adjustment  Method. 

As  in  section  A.  4,  we  may  consider  a  modification  of  the 
gradient  method  of  the  previous  section  in  which  some  of  the 
variables  are  assumed  to  adjust  inifinitely  rapidly,  or,  in  other 
words,  to  be  at  all  points  in  the  adjustment  process  at  an  optimal 
position  given  the  values  of  the  other  variables.  For  the  gradient 
method  described  by  (43 — 44),  we  consider  the  variant  where  the 
z/s  are  supposed  to  maximize  the  Lagrangian  for  any  given  set 
of  p/s,  while  the  p/s  continue  to  be  adjusted  in  accordance  with 
(44).  In  symbols,   (43)  is  replaced  by, 

s 

(45)     zll  .  .  .,  zp  maximize  f(zv  .  .  .,  zp)  +  £&&(*i»  .  •  .,  zv) 

3=1 
for  given  plt  .  .  .,  p8. 

(44)   and   (45)  together  constitute  a  dynamic  system.  This 

system  is  closer  to  the  usual  supply-and-demand  model  than 
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the  system  of  the  preceding  section.  In  the  present  system,  for 
any  given  set  of  prices,  the  activity  levels  optimal  for  that  set 
of  prices  are  determined.  These  determine,  in  turn,  the  supply 
and  demand  of  each  commodity,  and  the  prices  then  move  in  a 
direction  determined  by  the  difference  of  supply  and  demand. 
In  the  system  of  the  last  section,  the  activity  levels  were  not  at 
an  optimal  level  but  only  being  varied  so  as  to  increase  the 
difference  between  utility  and  costs. 

As  noted  in  section  A. 4,  any  system  which  depends  upon 
instantaneous  optimization  with  respect  to  some  variables 
runs  the  risk  of  being  undefined.  At  any  point  in  the  adjustment 
process  where  the  prices  still  differ  from  equilibrium,  some  of 
the  optimal  activity  levels  might  be  infinite.  With  this  qualifi- 
cation, however,  the  price-adjustment  process  defined  by  (44) 
and  (45)  converges  satisfactorily. 

Theorem  8.  If  f(zv  . .  .,  z-9)  is  strictly  concave  and 
gj(zlf  .  .  .,  zp)  is  concave  for  each  /,  then  the  price-adjustment 
process  defined  by  (44)  and  (45)  converges  in  a  region  suf- 
ficiently close  to  equilibrium  so  that  (45)  is  always  well-defined. 

III.   CONDITIONS  FOR  VALIDITY  OF  THE  NATURAL 
MARKET  MECHANISMS 

In  this  part,  we  will  state  the  assumptions  about  the  resource 
allocation  model  of  part  I,  section  C,  for  which  the  global  static 
and  dynamic  characterizations  of  an  optimum  given  in  Theorem 
2,  5,  and  8  of  part  II  can  be  applied  in  a  straightforward  fashion. 
The  resulting  criteria  have  natural  economic  interpretations. 
For  the  static  characterization,  the  assumptions  needed  are  that 
all  the  functions  involved  are  concave;  for  the  determination 
of  dynamic  processes  which  converge  to  an  optimum,  somewhat 
stronger  conditions  are  needed. 

A.  Static  Characterization  of  the  Optimal  Allocation. 

We  will  first  make  the  following  two  assumptions  about  the  utili- 
ty sector,  respectively,  in  the  resource  allocation  model  of  Part.  I 
section  C. 

(U—C)  The  utility  function  U(yv  . . .,  yn)  is  a  concave  function. 
(P  —  C)     The  functions  gu(Xj)  are  concave  functions. 
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Assumption  (P  —  C)  is  a  straightforward  statement  of  non- 
increasing  returns  to  scale.  The  status  of  assumption  (U  —  C) 
is  slightly  more  complicated.  The  ordinal  point  of  view  would 
deny  meaning  to  any  statement  about  a  utility  function  which 
is  not  invariant  under  monotonic  transformation.  Concavity  is 
not  a  property  with  the  desired  in  variance;  indeed,  it  implies 
diminishing  (more  strictly,  non-increasing)  marginal  utility  for 
each  commodity,  a  well-known  shibboleth  for  distinguishing 
the  cardinalists  from  the  ordinalists. 

The  usual  ordinalist  assumption  is  that  the  indifference  surfaces 
are  convex  to  the  origin.  An  alternative  formulation  can  be 
made  in  terms  of  the  following  definitions: 

Definition  3.  A  function  f(zlf  .  .  .,  zp)  is  said  to  be  quasi- 
concave  if  for  any  two  distinct  points  (zlf  .  .  .,  zp)  and 
(z[,  .  .  .,  z'v)  and  any  third  point  (z",  .  .  .,  z'p)  such  that 
*'/  £±=  dZi  +  (1  -  6)z't  (i  =  1,  .  .  .,  f>),  where  0  <  d  <  1, 
f(z±t . . .,  z'J)  is  at  least  as  great  as  the  smaller  of  the  two  num- 
bers f(zv  .  .  .,  z p)  and  f(z'lt  .  .  .,  zp).  The  function  is  said  to 
be  strictly  quasi-concave  if  the  strict  inequality  holds  in  the 
last  statement. 

For  the  moment,  we  will  not  use  the  last  part  of  the  definition. 
It  is  easy  to  see  geometrically  that  an  indifference  map  which 
possesses  a  quasi-concave  utility  indicator  indeed  statisfies  the 
usual  convexity  assumptions.45)  This  property  further  is  in- 
variant under  monotone  transformations  of  /. 

Now  it  has  been  shown  that  under  certain  weak  regularity 
conditions,  there  exists  for  any  quasi-concave  function  a  mono- 
tone transform  which  is  concave.46)  Such  a  function  can,  of 

45)  Despite  the  importance  of  this  assumption  in  consumer's  demand  theory, 
little  attention  has  been  given  to  its  justification.  For  an  argument  due  to 
T.  C.  Koopmans,  see  K.  J.  Arrow,  "An  Extension  of  the  Basic  Theorems  of 
Classical  Welfare  Economics,"  in  Proceedings  of  the  Second  Berkeley  Symposium, 
pp.  529 — 30;  T.  C.  Koopmans,  Three  Essays  on  the  State  of  Economic  Science 
(New  York,  Toronto,  and  London:  McGraw-Hill,  1957),  pp.  26—28.  See  also 
W.  M.  Gorman,  "Convex  Indifference  Curves  and  Diminishing  Marginal 
Utility,"  Journal  of  Political  Economy,  LXV  (1957),  40 — 50. 

46)  See  W.  Fenchel,  Convex  Cones,  Sets,  and  Functions  (Department  of 
Mathematics,  Princeton  University,  1953)  (mimeographed),  pp.  115 — 37; 
B.  de  Finetti,  "Sulle  stratificazaioni  convesse,"  Annali  di  Matematica  Pur  a 
e  Applicata,  vol.  30  (1949),  173—83. 
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course,  equally  well  serve  as  a  utility  indicator.  The  significance 
of  (U  —  C),  then,  is  that  among  the  infinitely  many  utility 
functions  which  represent  the  indifference  map  of  the  economic 
unit,  we  can  choose  (at  least)  one  which  is  concave. 

The  problem  is  to  characterize  the  maximization  of  the  utility 
function,  subject,  to  the  constraints  (2),  with  respect  to  the 
variables  ylf  .  .  .,  yn,  xv  .  .  .,  xm.  It  is  easy  to  verify  from  Defini- 
tion 1  that  a  function  which  is  concave  with  respect  to  some 
members  of  a  set  of  variables  and  where  the  other  variables  do 
not  enter  at  all  is  concave  with  respect  to  all  of  them.  From 
(U  —  C),  then,  U(yv  .  .  .,  yn)  can  be  regarded  as  a  concave 
function  of  all  the  variables  ylt  .  .  .,  yn,  xlt  .  .  .,  xm.  Similarly, 
gijixj)  are  concave  functions  of  the  same  variables.  The  function 
— yif  is  a  linear  and  therefore  concave  function  of  all  these 
variables.  If  we  let 

m 

(46)  gttov  •  •  •>  yn>  xv  •  •  •>  xm)  =  j,zu(xi)—Vi+&i  (*  =  l»  •  •  •'  n)> 

3=1 

m 

(47)  gifov  •  •  •»  yn>  %i>  •  •  •> »«)  =  ?,giifa)+£i  (*'=•»■+ 1, .. .,  s) 

then  each  of  the  functions  gt  is  a  sum  of  concave  functions  and 
therefore  itself  concave.  (Note  that  we  write  gt  as  a  function 
of  all  the  ^/-variables,  though  in  fact  g^  depends  only  on  yt  if 
i^-i^n  and  does  not  depend  on  any  of  the  y- variables  if 
i  >  n.) 

With  the  aid  of  (46)  and  (47),  (2.1)  and  (2.2)  can  be  rewritten 

(48)  gitol,  •  •  •>  Vn>    *V  •  •  •>  *m)   ^  0       (*  =  1,  .  .  .,  S). 

The  function  gt  is  the  excess  of  supply  (both  produced  and  natural) 
over  demand  (both  final  and  interindustry).  It  will  be  referred 
to  as  the  excess  supply. 

The  problem  of  maximizing  U(yv  .  .  .,  yn)  subject  to  the 
constraints  (48)  now  satisfies  the  conditions  of  Theorem  1;  the 
variables  yv  .  . .,  yn,  xlt  .  .  .,  xm  correspond  to  zv  .  .  .,  zp,  the 
function  U(yv  .  .  .,  yn)  to  f(zv  .  .  .,  zv),  and  the  functions  gt 
to  the  functions  gjt  It  can  therefore  be  asserted  that: 

(49)  a  necessary  and  sufficient  condition  that  yx,  .  .  .,  yn, 
xv  .  . .,  xm  maximize  U(yv  .  .  .,  yn)  subject  to  the  constraints 
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(48)  is  that  there  exist  numbers  plf  .  .  .,  ps  such  that  the  La- 
grangian 

8 

=  U(yv  .  .  .,  yn)  +  J  PigdVv  •  •  -,  yn>  xi>  •  •  •>  *m)> 

has  (ylt  .  .  .,  yn,  xx,  .  .  .,  xm;  pv  .  .  .,  fis)  as  a  saddle-point.  (Here, 
^.  is  the  Lagrange  multiplier  associated  with  the  constraint 

To  simplify  notation,  we  shall  write  y  for  (ylt  .  .  .,  yn)  and 
similarly  with  the  other  sets  of  variables. 

From  its  definition,  the  saddle-point  criterion  contains  two 
statements: 

(50.1)  the  function  L(y,  x\  p)  (i.e.,  the  Lagrangian  with  the 
multipliers  replaced  by  their  saddle-point  values)  attains  its 
maximum  as  a  function  of  y  and  x  when47)  y  =  y  and  x  =  x; 

(50.2)  the  function  L(y,  x;  p)  attains  its  minimum  at  p  =  p. 
We  will  now  give  some  economic  interpretation  to  the  state- 
ments (50)  by  considering  them  in  more  detail.  First,  consider 
(50.2);  the  interpretation  has  already  appeared  in  the  discussion 
leading  up  to  Theorem  2.  The  function  L(y,  x;  p)  is  linear  in  p. 
A  mechanical  application  of  (18)  in  part  II,  section  A. 5,  leads 
to  the  following  results: 

(51)  gAy,  x)  ^  o 

(52)  if  gi(y,  x)  >  0,  then  p{  =  0. 

(51)  is  a  repetition  of  the  feasibility  conditions   (48)   or   (2). 

(52)  adds  the  condition  that  for  any  commodity  for  which  supply 
exceeds  demand  at  the  optimum  the  associated  Lagrange  multi- 
plier is  zero. 

To  draw  the  implications  of  (50.1),  we  will  replace  in  the 
Lagrangian  gi  by  its  definition  as  given  in  (46)  and  (47). 


47)  y  =  y  is  an  abbreviation  for  the  set  of  equalities  yx  =  yv  y2  —  y, 
Vn  =  Vn-  Similar  notation  is  used  for  other  sets  of  variables. 
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n  m 

(53)         L(y,  x;  p)  =  U(yv  .  .  .,  y,)+  2&  Eft,  (*,)-*+&] 

i=l  j=l 

s  m 

+  IfiLigiA^  +  iil 

i=n+l        j=l 

n 

=  [U{yv.  .-,yn)  -iPiVi] 

m       s  s 

+  XH$igiA*i)l  +  I$iti- 

3=1  i=l  i=l 

The  Lagrangian  has  now  been  expressed  as  the  sum  of  (mJr2) 
terms.  The  first  depends  on  yv  .  .  .,  yn,  but  not  on  any  of  the 
x/s.  Each  of  the  next  m  is  of  the  form, 

s 

i=i 

say,  and  depends  only  on  one  variable,  the  scale  xi  of  the  jth 
process.  Finally,  the  last  term  is  a  constant.  To  maximize  a 
sum  of  functions  each  depending  on  a  different  set  of  variables 
involves  only  maximizing  each  of  them  separately.  Condition 
(50.1)  can  then  be  written, 

n 

(54.1)  the  function  U(yv  .  .  .,  yn)  —  2  Pi  y%  attains  its  maximum 

i=i 
with  respect  to  yv  .  .  .,  yn  at  yi  =  y4  {i  =  1,  .  .  .,  n); 

(54.2)  for  each  /,  the  function  nj(xj)  attains  its  maximum  at 

JC  j  —  JO  j . 

The  economic  and  institutional  interpretation  of  the  above 
criteria,  particularly  (52)  and  (54),  as  is  by  now  familiar,  requires 
identifying  the  Lagrange  multiplier  associated  with  a  commodity 
restraint  as  the  price  of  that  commodity.  Thus  pt  maybe  thought 
of  as  the  price  of  commodity  i.  Condition  (52)  then  states  that 
the  equilibrium  price  is  zero  for  any  commodity  for  which  there 
is  an  excess  of  supply  over  demand  at  equilibrium.  Condition 
(54.1)  says  that  the  final  demands  are  to  be  chosen  so  as  to 
maximize  the  difference  between  their  utility  and  their  cost  (at 
equilibrium  prices). 

In  discussing  (54.2),  first  note  that,  by  definition,  n^Xj)  is 
the  profit  (calculated  at  equilibrium  prices)  obtained  by  running 
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the  jth  process  at  scale  xy  Then  (54.2)  states  that  each  process 
should  be  run  at  a  scale  which  will  yield  maximum  profit. 

Conditions  (54),  particularly,  bring  out  the  possibility  of 
decentralization  through  the  price  system.  Given  the  set  of 
equilibrium  prices,  the  choice  of  the  final  demands  can  be  made 
separately  from  the  choices  of  the  process  scales,  and,  further, 
the  latter  can  be  made  separately  from  each  other.  Let  us 
imagine  that  a  manager  is  appointed  for  each  process  and  that  a 
helmsman  is  charged  with  choosing  final  demands.48)  For  any 
given  set  of  prices,  each  manager  is  instructed  to  choose  that 
scale  which  will  lead  to  a  maximum  of  the  profits, 

(55)  2  &*«(*,)  =  *,{*:  P). 

At  the  same  time,  the  helmsman  is  instructed  to  determine  the 
level  of  the  final  demands  so  as  to  maximize  the  difference 
between  utility  and  costs,  U(ylt  .  .  .,  yn)  —  ]T?=i  ttVi-  For  each 
desired  commodity  i,  the  choice  of  xi  by  each  manager  determines 
an  output  (possibly  negative)  gw(a^)  by  the  jth  process,  while 
the  helmsman's  decision  includes  one  for  y{.  Thus  net  demand 
for  the  desired  commodity  i  for  final  and  intermediate  use  is, 

m 

Vi-lLgij{xi)'> 

3=1 

for  the  equilibrium  prices,  conditions  (51)  and  (52)  require  that 
this  net  demand  does  not  exceed  the  initial  supply  (t  and  that,  if 
the  two  are  unequal,  the  price  pt  must  be  zero  (if  the  optimum 
can  be  reached  without  using  the  full  initial  supply,  then  the 
commodity  is  a  free  good).  Similarly,  for  a  primary  commodity  *, 
the  net  demand, 

m 
3=1 

must  be  compared  with  the  initial  supply  ^;  at  equilibrium, 
the  net  demand  must  not  exceed  the  initial  supply,  and  the  price 
pi  must  be  zero  if  there  is  an  excess  supply. 

48 )  We  follow  the  terminology  of  Koopmans,  "Production  as  an  Efficient 
Combination  .  .  ,"  pp.  93 — 95,  as  developed  for  the  linear  case;  however,  the 
function  of  the  helmsman  is  somewhat  different  here. 
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The  elements  of  decentralization  here  are  clear.  For  a  given 
set  of  prices,  a  process  manager  need  know  only  the  prices 
and  the  technology  of  his  own  process  in  order  to  arrive  at  the 
optimal  level  for  his  process.  The  helmsman  need  only  know  the 
prices  of  the  desired  commodities  and  the  utility  function.  Finally, 
the  equilibrium  on  each  market  may  be  checked  separately; 
for  any  given  market,  the  test  requires  knowing  only  the  net 
demand,  which  is  an  aggregate  of  many  individual  decisions,  the 
initial  supply,  and  the  price. 

Theorem  9.  If  (U  —  C)  and  (P  —  C)  hold,  then  yv  .  .  .,  yn> 
and  xlt  .  .  .,  xm  are  final  demands  ylt  .  .  .,  yn  and  process  levels 
xv  .  .  .,  xn  which  maximize  the  utility  function  U(ylt  .  .  .,yn) 
subject  to  the  feasibility  constraints  if  there  exist  prices 
pv  .  .  .,  ps  such  that  the  following  conditions  are  satisfied: 

n 

(a)  yv  .  •  .,  yn maximize  the  difference,  U{yv  .  .  .,  y^—^piVi* 
between  utility  and  costs;  i=1 

s 

(b)  for  each  process  /,  xt  maximizes  the  profit,  ^pigafej)', 

(c)  for  each  desired  commodity  i,  the  aggregate  excess  of 

m 

supply  over  demand,  2£wfe)  +  $t  ~  Vi>  ^s  non-negative; 

3=1 

if  positive,  then  the  price  pi  is  zero. 

(d)  similarly,  for  each  primary  commodity  i,  the  aggregate 

m 

excess   of  supply  over  demand,   2  Sa  fe)  H~  %%>   *s  non" 

negative;  if  positive,  then  the  price  pt  is  zero. 

It  may  be  noted  that  there  may  be  more  than  one  maximum 
in  (a)  or  (b).  This  will  usually  be  the  case  in  (b)  if  the  functions 
g^  are  linear,  as  in  the  linear  programming  case.  In  general, 
in  this  case,  a  process  manager  will  find  a  range  of  scales,  each 
of  which  achieves  the  possible  profit,  but  only  some  scales  will 
in  fact  be  optimal  for  the  economy.  The  choice  by  the  manager 
of  a  scale  which  is  not  optimal  for  the  economy  as  a  whole  will 
be  revealed  by  a  violation  of  one  of  the  conditions  (c)  or  (d). 
The  validity  of  Theorem  9  is  not  affected  by  this  remark,  but 
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the  force  of  the  decentralization  argument  is  somewhat  weakened 
under  these  conditions. 


B.  An  Alternative  Static  Characterization  of  the  Optimal  Allocation.. 

Theorem  9  as  a  characterization  of  an  optimal  allocation  is,  of 
course,  closely  related  to  the  criteria  of  welfare  economics,  as 
given  by  Hotelling,  Lange,  Koopmans,  Allais,  Lerner,  and. 
Bergson.49)  However,  condition  (a)  is  somewhat  different  from 
the  analogous  characterization  of  Pareto  optima  when  there  are 
many  consumers.  In  the  latter  case,  each  consumer  is  faced  with 
a  constrained  maximization  problem,  that  of  maximizing  the 
utility  function  subject  to  a  budget  restraint.  Since  we  are  here 
treating  the  special  case  where  there  is  only  one  consumer,  there 
should  be  and  is  an  analogous  theorem,  with  (a)  replaced  by  a 
constrained  maximization. 

Let  M  =  ^t^xpiVi,  the  total  expenditure  of  the  helmsman  at 
equilibrium.  Suppose  he  is  now  told  to  maximize  utility  subject 
to  the  constraint  that  his  expenditures  be  M,  i.e., 

(56)  2PiVi  =  M. 

Since  the  result  of  a  maximization  is  not  affected  by  changing- 
the  maximand  by  a  constant,  the  problem  is  equivalent  to 
maximizing  U{yv  .  .  .,  yn)-M  =  U(yv  .  .  .,  y^-^iPiVi  ob- 
ject to  the  constraint  (56).  But  we  already  know  that  the  un- 
conditional maximum  for  the  last  maximization  is  yl=yi 
(i  =  1,  .  .  .,  n).  Since  this  solution  satisfies  the  constraint  (56), 
it  must  also  be  the  constrained  maximum.  Hence  (a)  is  equivalent 
to, 

(57)  Si,  •  •  .,  yn  maximize   U(ylt  .  .  .,  yn)   subject  to  the  con- 
straint (56). 


49 )  See  H.  Hotelling,  "The  General  Welfare  in  Relation  to  Problems  of 
Taxation  and  of  Railway  and  Utility  Rates,"  Econometrica,  Vol.  6  (1938), 
242 — 67;  Lange,  "Foundations,  ..."  op.  cit.;  Koopmans,  ch.  1  in  Three  Essays, 
op.  cit.;  M.  Allais,  Traite  d' Economic  Pure  (Paris:  Imprimerie  Nationale,  1943) 
III,  604 — 82;  Lerner,  op.  cit.;  A.  Bergson  (Burk),  "A  Reformulation  of  Certain 
Aspects  of  Welfare  Economics,"  Quarterly  Journal  of  Economics,  VIII  (1938)< 
310—334. 
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So  far,  this  is  not  very  interesting  since  the  condition  (57) 
requires  a  knowledge  of  M.  Now  let  us  recall  the  homogeneity 
properties  of  demand  and  supply  functions  derived  from  utility 
—  and  profit  —  maximization.  The  behavior  described  in  (57) 
is  that  of  the  consumer  in  the  usual  theory  of  demand.  If  M 
is  changed  to  any  value  M  and  the  prices  changed  in  the  same 
proportion,  the  choice  of  the  y/s  is  unaffected.  At  the  same  time, 
the  choice  of  xi  is  unaffected  by  a  change  of  all  prices  in  the  same 
proportion.  Hence,  if  M  is  replaced  by  any  number  M,  we  can 
choose  the  price  pt  so  that  (57)  and  (b)  of  Theorem  9  are  satisfied. 
In  (c)  and  (d),  the  prices  enter  directly  only  in  the  form  of 
conditions  for  zero  prices;  but  a  change  of  all  prices  in  the  same 
proportion  leaves  the  zero  prices  unaltered.  We  can  state, 
Theorem  9'.  Theorem  9  remains  valid  if  (a)  is  replaced  by, 

(a')ylf  .  .  .,  yn  maximize  U(yv  .  .  .,  yn)  subjects  to  the  budget 

restraint,  2?=i^*2/<  =  M,  where  M  can  be  chosen  arbitrarily. 

The  constrained  maximum  (a')  is,  as  usual  in  consumers' 
demand  theory,  invariant  under  monotonic  transformations  of 
the  utility  function.  It  can  thus  be  shown  that  Theorem  9', 
unlike  Theorem  9,  remains  valid  if  the  utility  function  is  to  be 
merely  quasi-concave,  rather  than  concave. 

C.  The  Economic  Meaning  of  the  Gradient  Method. 

We  will  now  apply  the  gradient  methods  of  section  E,  part  II 
to  the  resource  allocation  model  of  part  I,  section  C.  The  for- 
mulation is  parallel  to  the  application  of  Theorem  2  as  given 
in  section  A  of  this  part.  The  differential  equations  (43)  and  (44) 
then  become, 


(58)  dyjdt  = 

(59)  dxjdt  = 

(60)  dpjdi  = 


JO  if  Vi  =  0  and  {BU/dyt)  -  pt  <  0, 
[(dU/dyi)  —pi  otherwise; 

0  if  Xj  =  0  and  dnj\dxi  <  0, 

dni\dxi  otherwise; 

0  if  pi  =  0  and  gt  >  0, 

— gt  otherwise. 


In  (59),  n i  is  the  profit  evaluated  at  current  prices,  as  defined 
in  (55). 
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These  equations  can  be  obtained  by  directly  substituting  the 
particular  maximand  and  constraints  of  the  resource  allocation 
problem  into  the  general  gradient  method.  It  is  perhaps  more 
illuminating  to  notice  that  (58)  and  (59)  are  direct  dynamic 
counterparts  of  the  conditions  (a)  and  (b)  of  Theorem  9.  Once 
the  problem  of  the  helmsman  has  been  stated  as  an  unconstrained 
maximization,  it  is  natural  to  transform  the  static  condition  of 
finding  the  maximum  into  the  gradient  method  for  approaching 
it;  this  is  precisely  (58)  (compare  section  A.3,  part  II).  Equation 
(59)  has  precisely  the  same  relation  to  condition  (b)  as  (58)  has 
to   (a). 

But  of  course  we  now  assume  that  the  maximization  processes 
of  (58)  and  (59)  are  being  applied  at  any  set  of  prices  taken  as 
given  to  the  helmsman  and  the  managers,  not  merely  at  the 
equilibrium  set  of  prices.  As  the  final  demands  yt  and  process 
scales  xi  are  varied  in  accordance  with  (58)  and  (59)  respectively, 
the  prices  which  the  helmsman  and  the  process  managers  take 
as  given  are  themselves  being  varied  in  accordance  with  (60). 
The  meaning  of  (60)  has  already  been  explored  in  part  II, 
section  E. 

Thus  we  see  that  the  gradient  method  may  be  given  the 
following  institutional  interpretation.  The  helmsman,  taking  the 
prices  of  desired  commodities  as  given,  changes  each  final 
demand  at  a  rate  equal  to  the  difference  between  marginal 
utility  and  price,  except  that  if  the  final  demand  for  any  com- 
modity is  zero  and  the  marginal  utility  is  less  than  the  price, 
the  final  demand  remains  at  zero  (since  negative  final  demands 
have  no  meaning).  Hence  the  final  demand  for  a  commodity  is 
increased  if  marginal  utility  exceeds  price  and,  if  not  already 
zero,  decreased  if  marginal  utility  is  less  than  price.  For  each 
process,  the  manager,  taking  all  prices  as  given,  changes  the  scale 
of  his  process  at  a  rate  proportional  to  its  marginal  profitability, 
except  that  if  the  scale  is  zero  and  the  marginal  profitability 
negative,  the  scale  remains  at  zero.  Thus  a  process  increases 
in  scale  if  the  marginal  profitability  of  expansion  is  positive 
and  decreases  in  the  opposite  case  if  not  already  zero. 

The  choice  by  a  manager  of  his  process  scale  determines  the 
output  or  input  of  each  commodity  in  the  process.  For  any  one 
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commodity,  total  the  outputs  (taking  inputs  as  negative  out- 
puts) for  all  processes,  add  the  initial  supply  $t,  and,  in  the  case 
of  a  desired  commodity,  subtract  the  final  demand  determined 
by  the  helmsman.  The  result  is  the  excess  supply,  denoted  by 
git  of  the  desired  and  primary  commodities  respectively.  We  may 
imagine  for  each  commodity  a  custodian  (again  in  Koopmans's 
terminology)  who  varies  the  price  of  the  commodity  at  a  rate 
proportional  but  opposite  in  sign  to  the  excess  supply,  with  the 
qualification  that  if  the  price  is  zero  and  the  excess  supply 
positive  the  price  remains  at  zero.  This  instruction  is  the  well- 
known  "law  of  supply  and  demand":  the  price  of  a  commodity 
rises  if  demand  exceeds  supply,   falls  in  the  opposite  case. 

The  instructions  to  the  helmsman,  the  managers,  and  the 
custodians  represent  a  high  degree  of  decentralization  of  the 
information-gathering  and  decision-making  functions.  The  helms- 
man need  only  know  the  utility  function  and  the  prices  of  desired 
commodities  to  carry  out  his  rule  (58).  The  manager  of  a  process 
need  know  only  its  technology,  as  represented  by  the  functions 
gijiXj),  and  the  prices  of  the  commodities  entering  it.  A  custodian 
need  only  know  the  aggregate  difference  between  demand  and 
supply  on  his  market  (not  the  offers  and  demands  of  individual 
processes  or  of  the  helmsman)  and  the  price  of  his  commodity. 


D.  The  Conditions  for  Validity  of  the  Gradient  Method. 

The  solution  of  the  system  of  differential  equations  (58 — 60) 
will  not  converge  for  any  arbitrary  set  of  functions  U(yv  .  .  .,  yn) 
and  gijiXj).  Not  even  the  concavity  of  all  these  functions  suffices. 
As  was  seen  in  part  II,  section  D,  if  all  the  functions  involved 
are  linear,  the  gradient  method  will  lead  to  indefinite  oscillations 
in  some  or  all  of  the  variables  with  no  convergence  to  the  op- 
timum solution. 

Theorem  5  supplies  a  sufficient  condition  for  convergence  in 
the  general  constrained  maximization  problem  but  one  that, 
unfortunately,  is  not  applicable  to  the  resource  allocation 
problem.  The  reason  for  inapplicability  is  that  the  maximand 
U(y±>  •  •  -,  yn)  does  not  contain  all  the  variables  with  respect  to 
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which  it  is  maximized  and  hence  cannot  be  strictly  concave  in 
all  the  variables. 

To  obtain  the  desired  results  we  will  now  strengthen  assump- 
tion [U  —  C)  to, 
(U  —  SC)     U(yv  .  .  .,  yn)  is  strictly  concave  in  yv  .  .  .,  yn. 

The  justification  for  (U  —  SC)  is  similar  to  that  for  (U  —  C). 
In  consumers'  demand  theory  it  is  usual  to  assume  not  merely 
that  the  indifference  surfaces  are  convex  to  the  axes  but  also 
that  they  are  not  linear  or  planar  and  have  no  linear  segments 
in  them.  This  requirement  insures  that  the  demand  functions 
are  well-defined  (i.e.,  single- valued)  since,  for  any  budget  plane, 
the  indifference  surface  tangent  to  it  has  only  one  point  in 
common  with  it.  We  may  state  this  as  a  requirement  on  the 
utility  indicator  in  the  terminology  of  Definition  3. 
(U  —  SQC)  U(yv  .  .  .,  yn)  is  a  strictly  quasi-concave  function. 
Under  suitable  regularity  conditions50),  if  (U  —  SQC)  holds, 
there  is  a  monotonic  transform  wThich  is  strictly  concave.  There- 
fore assumption  (U  —  SC)  means  essentially  that  a  suitable  one 
of  the  utility  indicators  has  been  chosen  in  the  definition  of 
condition   (58)  of  the  gradient  method. 

While  assumption  (U  —  SC)  is  not  sufficiently  strong  to  insure 
that  all  the  variables  will  converge  to  their  optimal  values  when 
the  gradient  method  is  applied,  it  is  sufficient  to  insure  that  the 
most  important  ones  do. 

Theorem  10.  If  (U  -  SC)  and  (P  -  C)  hold,  and  the 
variables  yit  xp  fi  are  varied  in  accordance  with  the  gradient 
method  defined  by  equations  (58 — 60),  then,  for  each  desired 
commodity  i,  the  final  demand  yi  and  the  price  ^>i  converge 

to  their  optimal  values  yi  and  f4  (i  =  1, ,  n).  The  process 

scales  xi  and  the  prices  of  the  primary  commodities  ft{ 
[i  =-n  +  1, . . .,  s)  may  oscillate  indefinitely  but  cannot 
diverge. 

Further,  for  any  desired  commodity  for  which  the  optimal 
final  demand  is  positive,  the  difference  between  supply  and 
demand  approaches  zero.  In  symbols,  for  any  i  for  which  ^  >  0, 

m 

tenb/t-2g«(*,)-itl  =  0. 

50 )   See  footnote  46. 
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Although  Theorem  10  assures  convergence  in  the  most  im- 
portant variables,  it  is  not  completely  satisfactory.  The  scales 
of  the  processes  must,  in  the  limit,  supply  the  equilibrium  final 
demands,  but  it  is  not  guaranteed  that  they  will  satisfy  the 
feasibility  constraints  with  respect  to  primary  commodities. 
The  process  scales  may  instead  oscillate  indefinitely  in  such  a 
way  that  the  demand  by  processes  for  a  primary  commodity 
oscillates  indefinitely  about  the  initial  supply.  Examples  can 
be  given  to  show  that  this  possibility  is  not  ruled  out  by  any 
assumption  thus  far  made. 

As  might  be  expected,  the  possibility  of  oscillations  is  con- 
nected with  linearity  of  the  production  processes.  We  can  make 
the  following 

Remark.  Even  if  some  of  the  processes  are  linear,  oscillations  in 
the  process  scales  can  occur  only  "by  accident",  in  the  sense 
that  special  relations  must  hold  among  the  input-output  coef- 
ficients which  define  the  different  linear  processes. 

It  follows  that  the  gradient  method  in  general  provides  a 
satisfactory  solution  of  the  resource  allocation  problem  under  the 
assumptions  of  Theorem  10. 

In  any  case,  the  absence  of  linearity  is  a  sufficient  condition 
for  convergence  in  all  the  variables.  We  need  to  assume  a 
sharpened  version  of  (P  —  C);  instead  of  merely  postulating 
non-increasing  returns,  we  will  wish  to  assume  strictly  diminishing 
returns.  In  mathematical  terms,  this  means  that  the  functions 
gij(%j)  defining  a  process  /  are  all  strictly  concave.51)  However, 
we  do  not  want  to  assume  that  the  functions  gw(a^)  are  strictly 
concave  for  all  i  and  /  because  that  would  imply  that  every 

51)  The  functions  gij{xt)  are  here  considered  to  be  strictly  concave  as  functions- 
of  the  process  scale  x}-.  In  nonlinear  processes,  there  may  be  more  than  one  natural 
definition  of  a  scale.  From  one  point  of  view,  any  strictly  monotonic  transfor- 
mation of  a  scale  is  itself  a  scale,  since  it  serves  equally  well  as  a  parameter 
in  defining  the  different  levels  of  the  process.  However,  the  previous  theorems 
have  shown  that  there  is  an  advantage  in  choosing  the  scale  so  that  the  functions 
gij{Xj)  are  concave.  If  this  condition  does  not  uniquely  specify  the  choice  of 
a  scale,  then  the  scale  can  be  so  chosen  that  all  the  functions  gti  which  enter 
the  process  can  be  chosen  strictly  concave. 

It  may  be  worth  remarking  that  the  condition  that  it  be  possible  to  choose 
strictly  concave  input  and  output  functions  is  stronger  than  the  condition 
that  the  process  not  be  linear.  The  first  condition  requires  that  the  process, 
have  no  linear  relations  between  any  input  and  any  output. 
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commodity  enters  into  every  process,  either  as  input  or  as 
output.  We  therefore  assume  the  following  less  stringent  sharpen- 
ing of  (P  —  C):  (P  —  SC).  For  each  commodity  i  and  process  /, 
the  function  gtj(x})  is  either  strictly  concave  or  identically  zero. 

We  can  then  state, 

Theorem   11.   If   (U  -  SC)   and   (P  -  SC)  hold,  and  the 

variables  yit  xj}  and  pi  are  varied  in  accordance  with  the 

gradient  method  defined  by  equations  (58 — 60),  all  variables 

converge  to  their  equilibrium  values.52) 

E.  The  Price- Adjustment  Method. 

We  will  now  apply  the  price-adjustment  method  presented  for 
constrained  maximum  problems  in  part  II,  section  F,  to  the 
resource  allocation  problem.  The  formulation  is  obvious;  the 
price-adjustment  is  the  limiting  form  of  the  gradient  method 
of  (58 — 60),  where  the  differential  equations  (58)  and  (59)  are 
replaced  by  the  conditions  that  for  any  given  set  of  prices  the 
helmsman  chooses  that  set  of  final  demands  which  maximizes 
the  difference  between  utility  and  costs  and  the  manager  of 
each  process  chooses  that  scale  which  maximizes  profits  (55). 
The  totality  of  these  decisions  determines  supply  and  demand 
on  each  market;  the  custodian  varies  price  in  accordance 
with  (60). 

It  must  be  recalled,  as  already  seen  in  part  II,  section  F, 
that  the  instructions  to  the  managers  to  choose  maximizing 
values  for  their  scales  implies  that  there  exists  a  maximum  and 
that  it  is  unique.  The  first  condition  is  satisfied  when  prices  are 
sufficiently  close  to  their  equihbrium  values  but  not  necessarily 
everywhere;  the  second  condition  requires  that  the  profit 
function  is  strictly  concave,  which  is  guaranteed  if  (P  —  SC) 
holds.  Similarly,  the  helmsman  has  a  unique  maximum  in  the  final 
demands  when  prices  are  close  to  equilibrium  and  the  utility 
function  is  strictly  concave.  We  shall  therefore  assume  that 
(P  -  SC)  and  (U  -  SC)  are  both  valid. 

For  any  given  set  of  prices,  plf  .  .  .,  fis,  let  the  helmsman  choose 
yv  .  .  .,  yn  so  as  to  maximize  U{yv  .  .  .,  yn)  —  Jd  PiVi  and  the 

52)  Mathematical  proofs  of  the  results  in  this  section  will  be  found  in  K. 
Arrow  and  L.  Hurwicz,  "Gradient  Method  for  Resource  Allocation,"  op.  cit. 
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manager  of  the  jth  process  choose  xj  so  as  to  maximize  7tj(Xj). 
Denote  the  choice  of  helmsman  as  a  function  of  the  ft/s  by, 

Vl(Pl>  •  •  ->Pn)>  •  •  '>Vn(Pl>  •  •  >>Pn)- 

Vi(Pi>  •  •  •»  fin)  is  *ne  final  demand  function  for  the  ith  desired 
commodity.  The  value  of  x3-  chosen  by  a  manager  determines 
the  net  output,  g^(^),  of  the  ith  commodity  by  the  /th  process; 
as  a  function  of  the  prices,  it  may  be  denoted  by  y^ipx,  .  .  .,  p8), 
the  supply  (demand  if  negative)  of  the  ith  commodity  by  the 
jth  process>  The  aggregate  excess  supply  for  the  ith  commodity, 
as  a  function  of  prices,  is  then, 

m 

(61.1)  yi{K  ■  ■  -,  Ps)=lYiAK  ■  ■  ;  Ps)+£i-%(Pl,  ■  ■  ;  Pn) 

1=1  (*'  =  1, ..'.,»); 

m 

(61.2)  Yi(plt  .  .  .,  pa)=2yuiPv  •  •  -  Ps)+Zi      {i  -  n  +  1,  .,  .,  s). 

3=1 

In  this  notation,  the  equilibrium  conditions  for  an  optimal 
allocation,  as  given  in  Theorem,  9  are, 

(62)  y€(pv  .  .  .,ps)  >  0    (i  =  1,  .  .  .,  s);   for   any   i  such   that 
7i(Pi>  .••>£*)><>>  Pi  =  o> 

that  is,  the  equalization  of  supply  and  demand  with  the  necessary 
qualification  for  free  commodities.  The  equilibrium  values  of 
final  demands  and  process  scales  are  determined  from  the  equi- 
librium prices  by  the  maximization  that  determined  the  supply 
and  demand  functions.  yt  =  y^px,  .  .  >,pn),  and  xi  is  the  profit- 
maximizing  choice  which  determined  the  functions  yiP  evaluated 
for  equilibrium  prices. 

The  dynamics  of  the  price-adjustment  method  are  then  simply 
stated: 

(63)  dpjdt  =  (°  if  ft  =  °  a"f  *&>  '.  •  -  *>  >  °' 

\-Yifrv  •  •  ■>  Ps)  otherwise. 

Since  the  price-adjustment  method  is  simply  a  limiting  form 
of  the  gradient  method,  the  conditions  for  its  validity  are  the 
same. 

Theorem  12.  The  price-adjustment  method  defined  by  (63) 

converges  under  assumptions  (U  —  SC)  and  (P  —  SC). 
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F.  Remarks  on  an  Alternative  Form  of  the  Price-Adjustment 
Method. 

The  static  characterization  of  an  optimum  given  in  section  B 
suggests  the  following  alternative  form  of  the  price-adjustment 
method.  The  supply  and  demand  functions  of  the  processes, 
yij(fti>  -  •  •>  Ps)>  are  generated  in  the  same  way  as  in  part  E, 
but  the  final  demand  function,  fji(j>v  .  .  .,  pn),  are  defined  by 
the  maximization  of  U(ylf  .  .  .,  yn)  subject  to  the  budget  restraint, 
2*Li  Pi  Vi  =  M.  (The  dependence  of  the  final  demand  fjt  on  M 
is  not  indicated  in  the  notation,  since  M  is  taken  as  a  parameter 
and  remains  fixed  throughout  the  adjustment  process.)  Then  the 
definition  (61)  of  excess  supply  is  altered  only  by  replacing 
Vi(plt  .  .  .,  pn)  in  (61.1)  by  UPv  •  •  -.  Pn)l  let  UPi>  •  •  ..  Ps)  be 
the  excess  supply  in  the  new  system.  Finally  the  dynamics  are 
supplied  by  (62),  with  yt(plt  .  .  .,  ps)  replaced  by  y,€{p,v  .  .  .,  ps). 

In  this  form,  the  system  is  a  special  case  of  a  competitive 
economy,  in  which  the  productive  units  maximize  profits  at 
any  given  level  of  prices,  the  consumers  (in  this  case  there  is 
only  one)  maximize  utility  at  any  given  level  of  prices,  and 
prices  vary  according  to  the  law  of  supply  and  demand.  This 
is  the  model  sketched  by  Walras  and  formulated  more  precisely 
by  Samuelson.  The  conditions  under  which  the  equilibrium  of 
such  an  economy  is  stable  are  by  no  means  well  known,  but  a 
number  of  sufficient  conditions  have  been  established.53) 

IV.  MODIFIED  MARKET  MECHANISMS  IN  ABSENCE  OF 
DECREASING  RETURNS 

In  this  part,  we  will  treat  the  modifications  of  the  market 
mechanism  which  might  cope  with  the  problems  of  resource 
allocation  arising  under  increasing  or  constant  returns.  The 
treatment  must,  of  course,  be  much  more  tentative.  The  pos- 
sibility of  allocative  mechanisms  which  achieve  optimal  allocation 
under  increasing  returns  and  still  have  some  measure  of  decen- 
tralization is  only  beginning  to  be  explored. 

63)  See  K.  Arrow  and  L.  Hurwicz,  "On  the  Stability  of  Competitive  Equilibrium 
I,"  Econometrica,  Vol.  26  (1958),  522-552;  K.  Arrow,  H.  D.  Block,  and  L. 
Hurwicz,  "On  the  Stability  of  Competitive  Equilibrium  II,"  ibid,  Vol.  27 
(1959),  82-109. 
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A.  General  Remarks. 

As  we  have  seen  in  several  places  in  part  II,  the  absence  of 
concavity  conditions  on  the  functions  involved  has  two  con- 
sequences for  the  characterization  of  maxima  (constrained  or 
unconstrained):  the  first-order  conditions  do  not  completely 
distinguish  maxima  from  other  stationary  points,  and  in  any 
case  do  not  in  any  way  distinguish  global  from  merely  local 
maxima.54)  The  latter  problem  cannot  be  dealt  with  by  any 
static  characterization  which  is  based  on  derivatives.  Correspond- 
ingly, no  variation  of  the  gradient  method,  which  is  based  on 
moving  uphill  as  measured  solely  by  local  variations,  can  be 
expected  to  insure  arrival  at  the  highest  of  several  peaks;  at 
best,  only  convergence  to  a  local  maximum  can  be  expected. 

A  simple  illustration  is  provided  by  an  economy  with  one 
output,  one  input,  and  two  processes  both  operating  under 
increasing  returns.  Let  xt  be  the  input  into  the  ith  process, 
fi(Xi)  the  output  of  the  ith  process,  and  £  the  total  amount  of 
input  available,  so  that  xx  +  #2  =  ?•  We  assume  /<(#*)  >  0 
fi(xi)  >  0-  We  wish  to  maximize  yf(x1)  =  fifa)  -f-  f2(£  —  x±). 
We  then  have 

We  locally  maximize  ip{x^)  subject  to  0  ^  x2  g  f.  If  the  maxi- 
mizer  xx  satisfies  0  <  x±  <  1,  then  we  must  have  ip'{xi)  =  0, 
V>"(#i)  =  &  Bu*  under  the  assumptions  made  ^"(a^)  >  0  for 
all  xv  Hence  either  xx  =  0  (and  x2  =  |)  or  xx  =  £  (and  x2  =  0) ; 
in  any  case  one  of  the  two  processes  is  at  zero  level. 

Now  suppose  that  /[{£)  >  f2(0)  and  also  f'2(£)  >  /J(0).  Then 
xp'(£)  >  0  and  y/(0)  <  0,  so  that  a^  =  £  and  a^  =  0  are  both 
local  maxima.  (That  this  can  happen  is  seen  from  the  example 

fiixi)  =  &ix%  ai  >  °0  If  /i(£)  ¥=  f2^)>  onty  one  °f  tne  two  local 
maxima  is  a  global  maximum.  (Take  ax  ^  a2  in  the  preceding 
example.)  Hence  we  may  expect  to  encounter  cases  of  local 
maxima  which  are  not  global. 

We  will  suppose  from  now  on  that  we  know  in  a  general  way 

64)  See  part  II,  sections  A.l  and  A.5,  and  the  Remark  to  Theorem  1  in 
section  B 
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where  the  global  maximum  is;  the  dynamic  problem  will  be  that 
of  locating  it  precisely.  Within  the  framework  of  gradient 
methods,  nothing  better  can  be  expected. 

In  any  case,  we  will  want  to  seek  processes  that  do  not  converge 
to  stationary  points  that  are  not  even  local  maxima,  while  if  we 
do  start  in  the  neighborhood  of  a  local  maximum  we  wish  our 
approximation  methods  to  converge  to  it.  These,  then,  are  the 
criteria  by  which  we  will  judge  modifications  of  the  market 
mechanisms  which  seek  to  yield  optimal  resource  allocations  under 
increasing  returns.  Our  mathematical  tools  are  the  static 
characterizations  of  part  II,  section  C,  and  their  dynamic 
counterparts,  Theorems  6  and  7  of  part  II,  section  E. 

B.  The  Unmodified  Lagrangian  Conditions. 

For  static  conditions  for  an  optimal  allocation,  we  start  with 
the  Kuhn-Tucker  Theorem  1,  which  has  in  fact  been  the  basis 
of  discussions  of  optimal  allocation  to  date.  The  application  of 
Theorem  1  to  the  resource  allocation  problem  is  entirely  parallel 
to  that  of  Theorem  2  in  the  case  where  the  concavity  conditions 
necessary  for  the  latter,  as  given  in  Theorem  9.  Each  of  the  four 
conditions  there  has  a  parallel  in  the  more  general  case. 

The  analogue  of  condition  (a)  is  that, 

(64)  UVi  ^  fr  (i  =  1,  .  . .,  n)\  if  UVi  <  pif  then  yi  =  0. 

If  we  retain  the  concavity  conditions  (U  —  C)  for  the  utility 
function,  then  the  relations   (64)  are  necessary  and  sufficient 
for  a  maximum  of  the  difference,  U(yv  .  .  .,  yn)  —  ^Jl^piVi,  so 
that  condition  (a)  of  Theorem  9  remains  valid. 
The  analogue  of  condition  (b)  is  that, 

(65)  dli^dXj  ^  0  for  xj  =  xf,  if  dnj\dxj  <  0,  then  xi  =  0. 

In  (65),  the  function  n^x/j  is  the  profit,  calculated  at  equi- 
librium prices,  attached  to  the  jih  process,  as  defined  just 
preceding  equations  (54). 

Finally,  conditions  (c)  and  (d)  remain  valid. 

Theorem  13.  If   (U  —  C)  holds,  then  the  following  con- 
ditions are  necessary  for  an  optimal  resource  allocation: 

(a)  the  helmsman  chooses  the  final  demands  yv  .  .  .,  yn  so 
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as  to  maximize  the  difference,  U  (yv  .  .  .,  yn)  —  ^f^ipiUi, 
between  utility  and  costs  calculated  at  equilibrium  prices; 

(b)  each  process  manager  chooses  a  process  scale  xi  such  that 
dnJdXj  ^  0,  but  the  strict  inequality  can  only  hold  if 
Xj  —  0; 

(c)  for  each  commodity  i  (i  ==  1,  .  .  .,  s),  the  excess  supply 
g{  evaluated  at  equilibrium  final  demands  and  process 
scales  must  be  non-negative;  if  positive,  the  price  p—0.55) 

The  difference  in  conditions  for  optimal  resource  allocation 
between  the  general  case  discussed  here  and  that  in  which 
concavity  assumptions  are  made  about  production  as  well  as 
consumption  is  solely  that  between  condition  (b)  of  Theorem  13 
and  condition  (b)  of  Theorem  9.  The  former  requires  that  at 
equilibrium  prices  the  marginal  profitability  of  each  process  be 
non-positive;  it  can  be  negative  only  if  the  process  is  operated 
at  zero  scale.  The  last  clause  is  perhaps  more  intuitively  meaning- 
ful in  its  contrapositive  form: 

(66)  if  the  marginal  profitability  is  positive  at  zero  output, 
then  the  optimal  value  of  the  process  scale  must  be 
positive. 

In  a  general  way,  the  non-negativity  clause  has  been  recognized 
in  the  literature;  it  is  well  known  that  one  of  the  chief  problems 
in  optimal  resource  allocation  under  increasing  returns  is  determin- 
ing which  processes  are  operated  at  some  positive  level.  Statement 
(66)  supplies  a  sufficient,  though  not  necessary,  condition. 

The  parallelism  between  Theorems  9  and  13  suggests  a  supply- 
and-demand  interpretation.  The  demand  functions  of  the 
helmsman  are  defined  precisely  as  before.  For  a  process  manager, 
however,  the  supply  or  demand  for  each  product  is  defined  as  a 
function  of  all  prices  by  first  choosing  the  process  scale  xj  so 
that  marginal  profitability  is  zero  at  the  given  set  of  prices 
(or  negative  at  zero  scale)  and  then  finding  the  corresponding 
values  of  g«  fo).  The  equilibrium  prices  are  then  defined  in  the 

55)  It  may  be  remarked  that,  analogously  to  Theorem  9,  Theorem  13  remains 
valid  when  condition  (a)  is  replaced  by  condition  (a')  of  Theorem  9';  see  part 
III,  section  B.  Theorem  13  is  identical  with  the  propositions  of  Hotelling,  Lange, 
and  Allais,  op.  cit.  footnote  43,  except  for  the  emphasis  on  the  non-negativity 
conditions. 
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usual  way  as  those  which  equate  supply  and  demand  (or  lead 
to  an  excess  supply  with  zero  price).56)  This  interpretation  is 
not  as  useful  as  the  corresponding  one  in  the  case  where  the 
concavity  conditions  on  production  hold.  First,  for  any  given 
set  of  prices  there  is  generally  more  than  one  value  of  the  process 
scale  which  satisfies  the  stated  conditions.  Under  increasing 
returns,  the  marginal  profitability  can  easily  increase  from  a 
negative  value  at  zero  output  through  zero  to  positive  values. 
Then  both  zero  and  the  value  of  the  process  scale  which  makes 
the  marginal  profitability  zero  satisfy  the  condition.  If  there  is 
ultimately  a  phase  of  diminishing  returns,  the  marginal  profi- 
tability may  again  become  zero,  so  there  can  be  three  possible 
values  of  the  process  scale.  Even  more  complicated  possibilities 
cannot  be  excluded.  The  corresponding  supply  and  demand 
functions  are  thus  multi-valued  and  not  well  defined. 

A  second  difficulty  closely  related  to  the  first  is  that  the 
system  as  a  whole  will  in  general  have  multiple  equilibria.57) 
We  have  already  seen  that  this  phenomenon  is  usual  in  applica- 
tions of  Theorem  1;  see  the  Remark  following.  Some  of  these 
equilibria  will  not  even  be  local  optima.58) 

If  we  specify  in  advance  a  neighborhood  of  the  optimal 
allocation  and  consider  only  alternatives  within  it,  these  two 
difficulties  will  in  general  disappear,  and  Theorem  13  provides 
a  satisfactory  static  characterization.  The  next  problem  is  the 
determination  of  a  dynamic  approximation  method  which 
converges  to  the  optimum  and  is  consistent  with  decentralization. 
Since  the  proper  aim  of  a  process  manager  may  be  to  minimize 
rather  than  maximize  profits,  the  dynamic  system  of  Theorem  10 
cannot  apply.  The  above  supply-and-demand  interpretation 
suggests  a  model  in  which  the  dynamic  element  is  simply  the 

66)  This  formulation  is  employed  by  Lange,  Socialism,  pp.  81 — 82. 

57 )  This  possibility  is  admitted  by  Lange  but  regarded  as  exceptional;  see 
Socialism,  pp.  82,  69 — 70. 

58)  There  is  a  third  problem,  of  a  type  which  we  have  not  stressed  in  this 
paper.  If  the  optimal  process  scale  is  such  that  the  marginal  profitability  is 
changing  from  negative  values  to  positive  ones,  then  the  position  is  one  of 
minimum,  not  maximum  profits.  While  this  is  not  a  difficulty  from  the  point 
of  view  of  formal  rules  of  operation,  the  incentives  to  the  process  manager  are 
clearly  a  good  deal  less  satisfactory  than  in  the  concave  case  where  social  optima 
correspond  to  profit  maxima. 
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adjustment  of  prices  according  to  supply  and  demand.  The 
simplest  form  is  the  system  studied  in  Theorem  12,  with  the  new 
interpretation  given  to  the  supply  and  demand  functions  of 
processes;  in  this  system  price  changes  at  a  rate  proportional 
to  the  negative  of  excess  supply.  We  will  show  by  example  that 
such  a  dynamic  system  can  be  unstable.  However,  in  section  E 
below  we  will  show  that  the  price-adjustment  model  will  be 
stable  with  more  complicated  rules  for  price  changes. 

Let  there  be  two  commodities  and  one  process.  Commodity  1 
is  a  desired  commodity  with  no  initial  supplies.  Commodity  2 
is  a  primary  commodity  with  an  initial  supply  of  1.  The  one 
process  has  commodity  2  as  input  and  commodity  1  as  output. 
If  we  use  the  input  as  the  process  scale,  xv  the  output  is  assumed 
to  be  sbJ/2,  so  that  the  process  displays  increasing  returns.  We 
assume  that  the  single  desired  commodity  1  is  desired  at  all 
levels.  Then  from  an  ordinalist  point  of  view  the  utility  function 
can  be  any  strictly  increasing  function  of  the  final  demand  yv 
We  choose  in  particular  a  strictly  concave  utility  function, 

(67)  U(yi)  =  (loSyi)l2. 

In  our  notation,  the  above  model  can  be  written 

(68)  g11{x1)  =  xl/2,  gn(xj)  =  -xv 

(69)  fa  =  0,  i2  =  1. 

Let  us  derive  the  supply  and  demand  functions  in  the  notation 
of  part  III,  section  E.  First,  the  demand  function  of  the  helms- 
man is  obtained  by  maximizing  the  difference,  ^{y-^—p^y^ 
so  that 

(70)  Kfa)  =  l/2fiv 

The  process  manager  chooses  xx  to  make  the  marginal  profitabili- 
ty zero.  Since  i^fa)  =  p1g11{x1)  +  p2g2iixi)  =  (Pixil2)  ~  Pzxv 
dn1jdx1  =  pxxx  —  p2  =  0,  so  that  x1  =  pJPv  This  choice  of 
process  scale  generates  the  following  excess  supply  functions  for 
the  two  commodities  by  substitution  into   (68): 

(71)  ynifo  p2)  =  P\ml  72i(Pv  h)  =  -{P2IP1Y 

The  optimal  allocation  is  obvious  by  inspection;  since  the 
output  is  always  desired  and  is  an  increasing  function  of  the 
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input,  we  simply  set  x1  =  1  and  therefore  yx  =  1/2.  If  we  start 
in  a  neighborhood  of  the  optimum,  we  can  disregard  non- 
negativity  considerations.  The  price-adjustment  model  analogous 
to  that  of  part  III,  section  E,  becomes, 

(72)  dpjdt  =  -7l(pv  p2)  =  ni{j>i)  -  7u(Pv  k)  -  h 

(73)  dpjdt  =  -y2(pv  p2)  =  -y2i(pv  ft)  -  h 

=  (PM  -  i. 

The  equilibrium  point  of  this  system  can  be  found  by  setting 
the  right-hand  sides  equal  to  zero,  that  is,  yx{px>  p2)  =  0, 
y2(p1,p2)  =  0.  From  (73),^  —  p2  at  equilibrium;  by  substitution 
into  (72),  we  see  that  p1  =  p2=  1.  To  investigate  the  local 
stability  of  (72 — 73).  we  find  the  matrix  of  partial  derivatives 
of  the  right-hand  sides  of  (72 — 73)  with  respect  to  p±  and  p2, 
evaluated  at  the  equilibrium  point;  the  condition  for  stability 
is  that  the  real  parts  of  the  characteristic  roots  of  this  matrix 
be  negative.59)  Straightforward  calculation  shows  that  the 
matrix  is. 


K  "!)■ 


The  characteristic  equation  is  a  quadratic  with  roots  [3  ±  Vl7]/4, 
so  that  one  characteristic  root  is  positive,  and  the  system 
(72—73)  is  unstable.60) 

C.  A  Digression  on  Marginal-Cost  Pricing. 

Suppose  that  the  jth  process  has  a  single  output,  say  com- 

59)  See  Samuelson,  op.  tit.,  pp.  270 — 74. 

60)  There  is  another  gradient  method  which  corresponds  to  Theorem  13 
(or,  more  generally,  to  Theorem  1)  and  which  does  converge  in  general  to  a 
local  maximum.  It  is  defined  by  moving  in  a  direction  which  is  as  close  to  the 
gradient  of  the  unconstrained  maximization  as  possible  compatible  with 
satisfying  the  constraints  at  all  times.  In  the  resource  allocation  model,  this 
would  require  that  the  excess  supply  for  each  commodity  would  have  to  be 
zero  (apart  from  free  goods)  at  all  points  in  the  approximation  process,  not 
merely  at  equilibrium.  There  seems  no  way  of  satisfying  this  requirement  in  a 
decentralized  fashion.  For  this  gradient  process,  see  G.  E.  Forsythe,  "Computing 
Constrained  Minima  with  Lagrange  Multipliers,"  Journal  of  the  Society  for 
Industrial  and  Applied  Mathematics,  Vol.  3  (1955),  173 — 78;  Arrow  and  Solow, 
op.  cit.,  sec.  3. 
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modity  1,  and  that  the  scale  of  the  process  is  taken  to  be  that 
output.  Then  g^fo)  =  xjt  and  —  g^(%)  is  the  input  of  commodity 
i  needed  to  produce  one  unit  of  commodity  1  (i  =  2,  .  .  .,  s). 
Clearly, 

(74)  dnJdXf  =p±-  2  Pl-Ml 

Since  —  g'^ipCj)  is  the  increase  in  input  i  per  unit  increase  in 
output,  (74)  may  be  interpreted  in  familiar  fashion;  the  marginal 
profitability  equals  price  less  marginal  cost.  Condition  (b)  then 
says  that  at  the  optimum  price  equals  marginal  cost,  except 
that  for  a  process  not  operated  at  all  price  may  be  less  than 
marginal  cost.61) 

We  have  thus  the  familiar  rule  of  marginal  cost  pricing  as  a 
condition  for  optimal  allocation  under  increasing  as  well  as 
diminishing  returns,  that  is,  that  for  a  process  operated  at  some 
positive  level  the  output  should  be  chosen  so  that  the  marginal 
cost  equals  the  price.  Under  diminishing  returns,  the  marginal 
cost  curve  will  be  rising,  and  the  rule  then  chooses  the  point  of 
maximum  profit.  Under  increasing  returns,  however,  the  marginal 
cost  may  be  falling,  and  the  (socially)  optimal  output  will  yield 
minimum  profit. 

Apart  from  the  non-negativity  condition,  then,  the  usual 
marginal-cost-pricing  condition  is  correct  when  applied,  as  in 
our  model,  to  individual  processes.  But  the  condition  is  usually 
applied  to  firms,  where  a  firm,  in  our  formulation,  is  a  single 
decision-making  unit  having  control  over  several  processes.  In 
this  form,  it  is  not  correct,  contrary  to  prevailing  opinion,  that 
the  optimal  allocation  corresponds  in  general  to  an  equality  of 
price  and  marginal  cost,  as  the  latter  term  is  usually  defined. 

To  see  this,  recall  that  the  cost  function  is  defined  in  economic 

61)  Actually,  this  interpretation  is  not  dependent  upon  identifying  the  scale 
of  process  with  the  output.  We  can  always  write, 

s 
dfijldxj  =  g'ufa)  {px  -  2  [pi  -  gu&Algii&i)]}- 

The  ratio,  —gl^x^lg^^Xj),  is  clearly  the  marginal  increase  in  input  i  for  a 
unit  increase  in  output;  hence  the  term  in  braces  is  the  difference  between 
price  and  marginal  cost.  Since  the  factor,  gu&j),  is  necessarily  positive,  the 
statement  in  the  text  is  always  equivalent  to  condition   (b). 
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literature  as  the  minimum  cost  of  producing  any  given  output, 
prices  being  taken  as  given.62)  Marginal  cost  is  then  simply  the 
derivative  of  the  cost  function  with  respect  to  output. 

The  usual  rule  of  equaling  marginal  cost  to  price,  then,  implies 
that  the  output  of  the  firm  is  produced  at  minimum  cost.  Despite 
the  reasonable  sound  of  this  statement,  it  is  not,  in  general,  a 
correct  rule  for  optimal  resource  allocation.  Consider  for  example 
a  firm  which  has  two  processes  with  the  same  output  but  dif- 
ferent inputs,  each  process  showing  increasing  returns.  Let 
Zife/i)  De  the  amount  of  input  1  required  to  produce  amount 
yx  of  the  output  by  process  1,  f2{y2)  the  amount  of  input  2  required 
to  produce  amount  y2  of  the  output  by  process  2.  Let  ^  and  £a 
be  the  amounts  of  inputs  1  and  2  initially  available.  We  assume 
increasing  returns,  so  that  f"(y^)  <  0,  f2{y2)  <  0.  The  rule  of 
minimizing  cost  at  any  given  output  requires  the  form  to  choose 
yx  and  y2  so  as  to  minimize  j>1f1{y1)  +  #2/2(2/2)  subject  to  the 
constraint,  y1-\-y2  =  y,  where  #x  and  #2  are  the  prices  of  inputs 
1  and  2  respectively.  This  is  equivalent  to  minimizing, 

wiVi)  =  #1/1(2/1)  +  #2/2(2/  -  Hi). 
with  respect  to  yx  subject  to  the  inequality  constraints  Of^y^y. 
By  differentiation,  ^"fe/i)  <  0  throughout  the  interval,  so  that 
the  minimum  must  occur  at  either  yt  =  0  or  y1  =  y,  i.e.,  y2  =  0. 
Thus,  no  matter  what  values  are  assigned  for  #x,  #2,  and  y,  one 
of  the  two  inequalities, 

filVi)  <  Si,  /2(2/2)  <  h> 
must  hold. 

On  the  other  hand  socially  optimal  allocation  requires  maxi- 
mizing   total    output    yx  +  y2    subject    to    the    constraints, 


62)  For  representative  texts,  see  G.  J.  Stigler,  The  Theory  of  Price,  (rev. 
ed.;  New  York:  Macmillan,  1952)  pp.  127 — 29;  K.  E.  Boulding,  Economic 
Analysis,  (3rd  ed.;  New  York:  Harpers,  1955)  ch.  34.  This  definition  of  cost 
is  clearly  the  correct  one  in  the  theory  of  the  firm,  whether  under  competition 
or  under  monopoly;  maximization  of  profit  requires  maximizing  the  difference 
between  revenue  and  cost  as  defined  in  the  text,  or,  equivalently,  the  equating 
of  marginal  revenue  and  marginal  cost. 

Writers  who  have  dealt  with  optimal  allocation  under  increasing  returns 
have  usually  not  been  careful  to  define  marginal  costs;  however,  Lange  ex- 
plicitly presents  the  definition  given  in  the  text  (pp.  116 — 17). 
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fiitfx)  tS=  Slt  j 2(2/2)  =  ^2:   It  is  obvious  that  the  maximum  is 
attained  by  choosing  yx  and  y2  so  that, 

fiiVi)  =  fi,  /a(ya)  =  ^2- 
Therefore  it  is  impossible  that  the  rule  of  cost  minimization  lead 
in  this  case  to  an  optimal  allocation  of  resources.  The  minimum 
cost  solution  is  not  socially  efficient  because  it  leaves  unemployed 
resources  which  have  positive  usefulness,  no  matter  how  factor 
prices  are  chosen. 

D.  Optimal  Allocation  Through  Imperfect  Competition. 

To  deal  with  the  drawbacks  of  Theorem  1  as  a  tool  for  deter- 
mining optimal  resource  allocation  under  increasing  returns,  we 
will  turn  to  Theorems  3  and  4  which  yield  local  saddle-point 
characterizations  of  the  optimal  allocation.  Unlike  Theorem  1, 
these  conditions  are  satisfied  only  by  local  maxima,  not  by 
other  extrema. 

First,  consider  the  application  of  Theorem  3  to  the  resource 
application  problem.  The  concavified  Lagrangian  with  trans- 
formed constraints  becomes, 

(75)  L*{yv  .  .  .,  yni  xv  .  .  .,  xm\  pv  .  .  .,  pjrj) 

=  Ufa  •  •  »yn)  +  XP*[i  -  (i  -  g«)1+n 

where  the  excess  supply  gi  =  gi(xv  .  .  .,  xm,  yv  .  .  .,  yn)  is  defined 
by  (46)  and  (47). 

The  optimal  allocation  is  characterized  as  a  local  saddle- 
point.  Since  L*  is  linear  in  the  p/s,  with  respect  to  which  there 
is  a  minimum,  we  have, 

(76)  1  —  (1  —  gi)1+1>*  ^  0;    for   any   i   for   which    the    strict 
inequality  holds,  pi  ==  0. 

Here,  gt  is  the  excess  supply  at  equilibrium,  that  is, 
gi(xv  .  .  .,  xm>  yv  .  .  .,  yn).  However,  it  is  easy  to  see  that  the 
expression, 

1  -  (1  -  gi)1+Vi> 

is  positive,  zero,  or  negative  according  as  gt  is  positive,  zero,  or 
negative;  indeed  it  was  originally  chosen  to  have  this  property 
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(see  remarks  preceding  (37)  in  part  II,  section  C).  Hence  (76) 
is  equivalent  to, 

(77)  gt  ^  0;  if  gt  >  0,  then  ft  =  0, 

as  was  to  be  expected. 

Let  us  combine  the  maximization  part  of  the  saddle-point 

criterion  with  (77). 

Theorem  14.  Under  certain  regularity  conditions,  a  neces- 
sary and  sufficient  condition  for  a  locally  optimal  allocation 
is  that, 

(a)  yv  .  .  .,  yn,xv  .  .  .,  £TO  maximize  the  expression,  U(yv...,yn) 
+  IfiO-  -  [1  ~  gt(*i,  •  •  -  »m,  2/i,  •  •  ..  yn)]1+Vi)> 

(b)  gi(xv  .  .  .,  xm,  yv  .  .  .,  y_n)  ^  0;   if  gi(xv  .  .  .,  xm, 

Vi>  •  -  •»  Vn)  >  .0*  then  ^^  =  0,  where  the  ^/s  have  been 
chosen  sufficiently  large. 

Condition  (a)  can  be  given  an  institutional  interpretation. 
Suppose  the  helmsman  takes  over  the  functions  of  the  process 
managers  in  addition  to  his  own,  so  that  he  chooses  the  #/s 
as  well  as  the  y/s.  At  the  same  time,  suppose  that  the  custodian 
for  commodity  i  purchases  any  amount  ut  available  for  an 
amount,  ft[l  —  (1  —  «<)*+**].  This  function  is  a  total  revenue 
function  in  the  ith  market.  The  helmsman  then  maximizes  the  sum 
of  his  utility  and  the  revenue  he  obtains  by  selling  the  excess 
supply  (the  "re venue' '  would,  of  course,  be  negative  if  there 
were  an  excess  demand).  The  helmsman  on  this  interpretation 
is  dealing  with  a  series  of  markets  on  which  he  is  an  monopolist 
or  monopsonist. 

In  this  formulation,  the  p/s  do  not  play  the  role  of  com- 
petitive prices,  but  they  are  parameters  which  affect  the  location, 
though  not  the  general  shape,  of  the  revenue  functions.  We  will 
therefore  refer  to  them  as  revenue  parameters.  Condition  (b) 
insures  that  at  equilibrium  the  revenue  parameters  are  chosen 
so  that  the  helmsman  will  have  zero  excess  supply  for  free  goods 
(when  pi  =  0,  the  revenue  is  identically  zero). 

So  far  the  institutional  interpretation  has  been  highly  central- 
ized. All  that  has  been  done  is  to  rephrase  the  constrained 
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maximization  problem  as  two  problems,  one  an  unconstrained 
maximization  and  one  a  feasibility  condition.  However,  any 
unconstrained  maximization  has  a  certain  element  of  decen- 
tralization implicit  in  it.  If  zv  .  .  .,  zv  maximize  a  function 
/(%,  .  .  .,  zv),  then  it  is  true  that  z1  maximizes  the  function 
f(zlt  z2,  .  .  .,  zp)  as  a  function  of  the  single  variable  zv  and 
similarly  with  the  other  z/s.  In  the  case  of  Theorem  9,  there  was 
an  extreme  simplification  in  the  unconstrained  maximization 
(given  the  pi's),  since  the  function  to  be  maximized  was  a  sum 
of  functions  each  involving  a  different  variable.  The  maximiza- 
tion in  Theorem  14  is  not  so  simple,  but  along  the  lines  just 
suggested  there  is  a  degree  of  decentralization  which,  as  we  shall 
see,  becomes  more  pronounced  in  the  dynamic  form. 
Let, 

(78)  hu  =  gi-  ga  (i  =  1, . . .,  s;  j  =  1, .  .  .,  m); 

(79)  *«=*«+'!*   (*  =  1,...,*). 

Reference  to  (46 — 47)  shows  that  hti  is  independent  of  xj}  though 
it  depends  on  the  scales  of  all  processes  other  than  the  jth  and, 
in  the  case  of  desired  commodities,  on  y{.  Similarly,  Ht  depends 
on  the  process  scales  but  is  independent  of  the  y/s.  If,  following 
the  argument  above,  we  maximize  with  respect  to  xit  taking 
the  y/s  and  all  process  scales  other  than  the  yth  as  given  at  their 
equilibrium  values,  we  have  that, 

s 

(80)  x§  maximizes  J M1  ~  ll  ~  K  ~  guM]1+nt} 

i=l 

the  term  U(yv  .  .  .,  yn)  being  omitted  since  it  is  a  constant  with 
respect  to  x^  Here  as  before  a  bar  over  a  symbol  denotes 
evaluation  at  equilibrium.  Similarly, 

(81)  yly...,yn  maximizes  0r^1,...;yJ-fi^4-[l-(l-^+^)1+^]. 

i=l 

Equations  (80)  and  (81)  suggest  a  degree  of  decentralization. 
Each  process  manager  chooses  an  xj  so  as  to  maximize  (80),  the 
helmsman  chooses  final  demands  so  as  to  maximize  (81).  Each 
is  now  operating  on  imperfectly  competitive  markets.  Each  now 
needs  somewhat  more  information  from  the  outside  than  in  the 
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concave  case.  The  manager  of  the  jth  process  needs  to  know 
not  only  the  p/s  but  also  the  ht/s  (i  =  1,  .  .  .,  s);  the  helmsman 
needs  to  know  both  ft  and  %i  (i  =  1,  .  .  .,  n). 

The  dynamic  counterpart  of  Theorem  14  is  derived  from 
Theorem  6.  If  we  apply  the  theorem  to  the  resource  allocation, 
we  find  by  straightforward  differentiation  that, 

(82)         i,*  =  ^-a  +  ^a-gi)". 

(83)  £*  =  2(1  + %)&(!-&)  i'gU- 

The  gradient  method  based  on  these  derivatives  requires  the 
helmsman  and  the  process  managers  to  find  the  unconstrained 
maxima  of  (81)  and  (80)  respectively,  with  equilibrium  replaced 
by  current  values.  The  information  required  is  greater  than  in 
the  concave  case  of  Theorems  10  and  11,  but  not  too  much 
greater.  The  helmsman  and  each  process  manager  must  now 
know  the  current  values  of  both  the  revenue  parameter  and  the 
excess  supply  for  every  market,  in  addition  to  his  utility  function 
or  technology,  respectively. 

The  behavior  of  the  revenue  parameters  is  a  straightforward 
dynamic  analogue  of  (76). 

Theorem  15.  Under  certain  regularity  conditions  and  for 

rji's  sufficiently  large,  the  system  of  differential  equations, 


[a)  dyjdt  = 


0  if  Z*  <  0  and  yt  =  0, 


[L*  otherwise, 


(b)^  =  («ifil<land^0' 


L*  otherwise; 

ro  if  [1  -  gi(xv  .  .  .,  xmi  ylf  .  .  .,  yu)Y+«<l 
(c)  dpjdt=    and  pi  =  0, 

It1—  iifci,  •  •  -,  xm,  ylt  .  .  .,  yn)]1+Vi-l  otherwise; 

converges  to  a  local  optimum  of  allocation.  Here  L*  and  L* 
are  defined  by  (82)  and  (83),  respectively. 

Theorem  15  thus  supplies  a  dynamic  system  which  converges 
locally  to  an  optimum  and  which  makes  very  limited  infor- 
mational demands  upon  its  participants. 

Theorem  4  leads  to  a  static  characterization  of  an  optimal 


DECENTRALIZATION  AND  COMPUTATION  IN  RESOURCE  ALLOCATION      99 

allocation  similar  to  that  of  Theorem  14.  The  new  feature  is  the 
existence  of  a  set  of  variables,  the  w/s,  which,  in  equilibrium, 
represent  the  permissible  excess  supply.  If  we  refer  back  to  the 
reasoning  leading  to  Theorem  4,  we  see  that  the  w/s  were 
introduced  to  convert  inequality  constraints  into  equality 
constraints.  At  equilibrium,  then  equality  must  hold  in  the 
constraints  gi(xlt  .  .  .,  xm,  yv  .  .  .,  yn)  —  wt  =  0. 

(84)  gt  =  wt\  if  gt  >  0,  then  ft  =  0. 

The  second  part  of  (84)  is  intuitively  obvious  and  can  be  proved 
rigorously. 

In  the  saddle-point  characterization  supplied  by  Theorem  4, 
the  concavified  Lagrangian  with  quadratic  modification  has  as 
maximizing  variables  the  w/s  as  well  as  the  x/s  and  the  y/s. 
However,  if  we  follow  the  argument  given  above  about  the 
decentralization  implicit  in  any  maximization  we  may  say  that 
the  helmsman  and  the  process  managers  maximize  the  La- 
grangian with  respect  to  the  variables  under  their  control, 
taking  as  given  both  the  p/s  and  the  w/s.  We  may  refer  to  the 
w/s  as  the  disposal  parameters.  Then,  given  both  the  revenue 
and  the  disposal  parameters,  the  helmsman  and  the  process 
managers  have,  just  as  in  Theorem  14,  to  maximize  a  nonlinear 
function  of  the  inputs  and  outputs.  Again  they  can  be  thought 
of  as  seeking  to  maximize  profits  and  utilities  by  activities  which 
include  trading  on  imperfect  markets. 

Theorem  16.  Under  certain  regularity  conditions  and  for  X 

sufficiently  large,  a  necessary  and  sufficient  condition  for  a 

locally  optimal  allocation  is  that, 

(a)  yv  .  .  .,  yn,  xv  .  .  .,  xm  maximize  the  expression, 

Ufa,  •  • .,  yn)  +  i  fate,  -  «*)  -  Hgi  -  «,)•]; 

t=l 

(b)  f<  =  wi  ^  0;  if  wt  >  0,  then  pt  =  0. 

The  dynamic  counterpart  of  Theorem  16  derives  from  ap- 
plication of  Theorem  6  to  the  resource  allocation  problem.  The 
differentiations  involved  are  straightforward. 

Theorem  17.  Under  certain  regularity  conditions  and  for  X. 

sufficiently  large,  the  system  of  differential  equations, 
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(*\   dv  Idt  -  1°  if  L*<  <  °  and  Vi  =  °' 

(a)  dyjdt  -  |^  otherwise> 

(b)  dx  idt  =  1°  if  L*'  <  °  and  Xj  =  °' 

5  \lJ   otherwise, 


(c)  dwjdt 


i 

0  if  pt  —  2Afc  —  w>«)  >  0  and  wi  =  0, 
— <pi  +  2A(gi  —  wt)  otherwise, 


(d)^/^  =  (oif,^>!''aild^  =  0* 

v  [Wi  —  g^  otherwise, 

converges  to  a  local  optimum  of  allocation.  In  (a)  and  (b), 
respectively, 

(e)  LJ,  =  ^  -  fc  +  2Afc  -  wt), 

(f)  4,,  =  2  tfc  -  2Afe«  -  ^)]g;,. 

1=1 

The  adjustment  formulas  for  the  individual  participants  are 
comparatively  simple.  The  helmsman  and  each  process  manager 
have  to  know,  in  addition  to  their  own  utility  function  or 
technology,  the  revenue  parameter,  pit  the  disposal  parameter, 
wit  and  the  excess  supply,  gif  for  each  commodity.  The  custodian 
for  commodity  i  has  to  know  g{  and  he  can  then  determine 
both  the  revenue  and  the  disposal  parameters  from  the  pair  of 
differential  equations  (c)  and  (d). 

E.  Optimal  Allocation  Through  Nonlinear  Price  Adjustment. 

If  we  examine  (82)  and  (83),  we  see  that  there  is  a  common 
expression  in  all  of  them,  for  which  we  may  introduce  a  symbol, 

(85)  <?.-=(! +%)&(! -?*)"• 

If  we  now  refer  to  (a)  and  (b)  of  Theorem  15,  we  see  that  the 
helmsman  or  the  process  managers  can  respectively  be  thought 
of  as  seeking  to  maximize  the  difference  between  utility  and 
costs  or  profits,  if  the  q/s  are  now  regarded  as  prices.  These 
prices  are  now  determined  by  a  fairly  complicated  adjustment 
pattern;  first  an  auxiliary  variable  pt  is  determined  by  equation 
(c)  of  Theorem  15  and  then  qt  is  calculated  from  (85).  However, 
the  informational  requirements  are  low,  since  the  custodian,  who 
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is  charged  with  these  operations,  need  still  know  at  any  one 
time  only  the  excess  supply  g^. 

Theorem  18.  Under  certain  regularity  conditions  and  for 
tj/s  sufficiently  large,  the  dynamic  system, 

(h\  J    IJt  —  /°  ^  dnjdx,  <  0  and  xi  =  0, 
*  '       j      ~~  \d7CjjdXj  otherwise, 

(c)   ?,=  (1  +  yi)fii(l  ~  gi)Vi, 

0  if  gi  >  0  and  pt  =  0, 


(d)  dfijdt  =  l(1  _  ft)1+f<  _  x  otherwise^ 
converges  to  a  local  optimum  of  allocation.  In  (b), 

s 

(e)  £;(»,)  =2qign{xi). 

i  =  l 

In  Theorem  18,  the  informational  requirements  on  each 
participant  are  no  different  than  in  the  concave  case  of  Theorem 
10.  The  only  elements  of  centralization  are  that  a  prior  decision 
must  be  made  on  the  constants  ^  and  that  a  global  optimum 
can  only  be  assured  if  the  initial  approximation  is  sufficiently 
close. 

A  limiting  case  of  some  interest  arises  if  the  adjustment  speed 
of  the  helmsman  and  the  process  managers  are  increased  in- 
definitely. In  that  case,  for  the  helmsman  we  must  have  Uy  =  qt 
unless  yi  =  0  and  similarly  dnj\dxi  =  0  unless  xj  =  0.63)  Then 
we  return  to  the  supply-and-demand  model  briefly  discussed  in 
section  B.  But  the  price-adjustment  rules  of  (c)  and  (d)  of 
Theorem  18  now  insure  stability. 

Theorem  19.  Theorem  18  remains  valid  if  (a)  and  (b),  are 

replaced  by, 

63)  It  should  be  made  clear  that  in  this  process  xs  is  not  necessarily  chosen 
to  maximize  process  profits,  taking  the  q/s  as  given  prices.  The  chosen  x$  does 
maximize  the  expression  (80),  with  pt  and  h^  being  replaced  by  their  current 
values,  pi  and  At7;  with  the  notation  of  (85),  the  statement  in  the  text  follows. 
Thus,  as  Lange  correctly  noted,  the  rule  that  marginal  profitability  be  zero 
replaces  the  rule  of  profit  maximization  for  increasing  returns;  however,  the 
above  analysis  shows  that  the  price-adjustment  rule  which  leads  to  a  stable 
allocative  system  is  not  a  simple  one. 
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(a')  UVi  ^  qi;  if  UVi  <  qt,  then  Vi  =  0; 

(b')  dizJdXj  5^  0;  if  dni\dxj  <  0,  then  xs  =  0. 

F.  Optimal  Allocation  Through  Price  Speculation. 

An  alternative  institutional  interpretation  can  be  given  of 
Theorem  17,  in  a  manner  somewhat  similar  to  that  which  led 
from  Theorem  15  to  Theorem  18.  If  we  look  at  (e)  and  (f)  of 
Theorem  17,  we  find  it  natural  to  introduce  the  definition, 

(86)  fi  =  Pi  +  U(Wi-gi). 

If  we  now  take  account  of  (d)  of  Theorem  17,  we  see  that,  unless 
both  gt  >  wi  and  p{  =  0, 

(87)  p'i^Pt  +  Uidpjdt), 

that  is,  p\  can  be  considered  as  an  expectation  of  the  price  of 
commodity  i  in  the  near  future  formed  by  extrapolation  of 
current  rates  of  change.64)  We  will  now  argue  that  (87)  can  be 
made  to  hold  without  the  exception  noted  above.  As  noted  in 
the  Remark  following  Theorem  4,  the  saddle-point  properties 
of  the  concavified  Lagrangian  with  quadratic  modification  hold 
whether  the  p/s  are  considered  to  be  non-negative  or  unrestricted 
as  to  sign.  Relation  (d)  of  Theorem  17  is  the  appropriate  form 
if  the  p/s  are  restricted  to  be  non-negative.  If  they  are  taken 
to  be  unrestricted  as  to  sign,  then  only  the  second  line  is  relevant; 
Theorem  17  remains  true  with  this  alteration.  Then  (87)  is  true 
in  general. 

Theorem  20.  Under  certain  regularity  conditions  and  for  X 
sufficiently  large,  the  system  of  differential  equations, 

\UVi  —  pi  otherwise, 

64)  Extrapolative  expectations  of  the  type  of  equation  (87)  have  been  studied 
in  connection  with  inventories  by  L.  Metzler,  "The  Nature  and  Stability  of 
Inventory  Cycles,"  Review  of  Economic  Statistics,  XXIII  (1941),  113 — 29, 
and  in  connection  with  inflationary  price  movements  by  A.  C.  Enthoven,  "Mone- 
tary Disequilibria  and  the  Dynamics  of  Inflation,"  Economic  Journal,  Vol.  66 
(1956),  256 — 70;  see  also,  A.  C.  Enthoven  and  K.  Arrow,  "A  Theorem  on  Ex- 
pectations and  the  Stability  of  Equilibrium,"  Econometrica,  Vol.  24  (1956) 
288 — 293.  More  detailed  study  of  the  system  studied  in  this  section  will  be 
found  in  Arrow  and  Solow,  op.  cit.,  sec.  4. 
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/u\  j    ijt  _  P-  ^  dn^jdXj  <  0  and  xj  =  0, 

*  '       jl     ~  [djz^/dXj  otherwise, 

(c)  ^  Aft  -  (°  if  ^  >  °  and  ^'  =  °' 

*  '        *'  {—ft*  otherwise, 

(d)  dftjdt  =  w<  -  &, 

(e)  #  =  pt  +  2A(#,/^), 

converges  to  a  local  optimum  of  allocation.  In  (b), 

(f)  ^(x^it'tgvk)- 

Relations  (a)  and  (b)  show  that  the  helmsman  and  the  process 
managers  are  moving  in  the  direction  of  maximum  benefits 
computed  at  expected  prices,  rather  than  current  ones.  Equa- 
tions (c — e)  can  be  given  various  institutional  interpretations; 
one  is  that,  taken  together,  they  instruct  the  custodian  how  to 
determine  the  expected  price  to  be  announced  to  the  other 
participants.  It  is  required  that  the  expected  price  be  sufficiently 
sensitive  to  the  current  rate  of  change.65) 

G.  The  Linear  Case. 

We  return  finally  to  the  case  which  started  the  entire  in- 
vestigation, that  where  the  utility  function  and  all  the  processes 
are  linear,  that  is,  the  case  of  linear  programming.  In  this  case, 
as  we  have  seen  in  part  II,  section  D,  the  gradient  method  will 
normally  lead  to  endless  oscillations.  Since  stability  has  been 
shown  to  be  related  to  the  strict  concavity  of  the  Lagrangian, 
it  is  natural  to  try  one  of  the  concavification  methods  introduced 
in  part  II,  section  C.  Since  the  problem  is  linear,  the  Lagrangian 
is,  so  to  speak,  on  the  borderline  of  strict  concavity,  and  it 
might  be  expected  that  any  modification  of  the  Lagrangian 
which  increases  its  concavity  would  suffice.  This  expectation 
is  correct,  at  least  as  far  as  the  method  of  concavification  by 
transformed  constraints. 

65)  The  system  of  Theorem  20  and  its  interpretation  in  terms  of  price  ex- 
pectations is  similar  but  not  identical  to  the  proposal  of  T.  Kose,  "Solutions 
of  Saddle  Value  Problems  by  Differential  Equations,"  Econometrica,  Vol.  24 
(1956),  59—70. 
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Theorem  21.  If  U(ylt  .  .  .,  yn)  and  the  process  functions 
gij{Xj)  are  all  linear,  then  the  system  of  differential  equations 
of  Theorem  15,  with  any  77 z-  >  0  converges  to  a  global  optimum 
of  allocation.  Equivalently,  the  dynamic  systems  of  Theorems 
18  and  19  converge  to  a  global  optimum  under  the  same 
condition.66) 

The  method  of  Theorem  21  is  by  no  means  the  only  gradient 
method  for  the  solution  of  linear  programming  problems.  Because 
of  their  equivalence  to  zero-sum  two-person  games,67)  the 
modified  gradient  methods  developed  by  Brown  and  von  Neu- 
man  for  solving  the  latter  can  be  applied  to  the  former.68)  Of 
course,  there  are  other  methods,  of  which  the  best  known  is  the 
simplex  method,  which  are  essentially  different  from  the  gradient 
methods.  Although  there  has  been  as  yet  insufficient  com- 
putational experience,  it  is  very  likely  that  the  simplex  method 
is  a  superior  method  for  computation  to  any  variety  of  the 
gradient  method.  But  from  the  viewpoint  of  the  present  study, 
it  lacks  the  very  important  virtue  of  decentralization. 


66 )  For  a  proof  of  Theorem  21,  see  Arrow  and  Hurwicz,  "Gradient  Method 
for  Resource  Allocation,"  op.  cit.,  Theorem  3.  An  illustration  of  this  method 
with  remarks  on  the  problems  of  machine  calculation  is  found  in  T.  Marschak, 
"An  Example  of  a  Modified  Gradient  Method  for  Linear  Programming,"  ch.  9 
in  Arrow,  Hurwicz,  and  Uzawa,  op.  cit. 

67)  See  footnote  37. 

68)  See  G.  W.  Brown  and  J.  von  Neumann,  "Solutions  of  Games  by  Dif- 
ferential Equations,"  in  H.  W.  Kuhn  and  A.  W.  Tucker,  eds.,  Contributions 
to  the  Theory  of  Games  I,  Annals  of  Mathematics  Study  No.  29  (Princeton, 
New  Jersey:  Princeton  University  Press,  1950),  pp.  73 — 80.  For  the  application 
to  linear  programming,  see  M.  Fukuoka,  "A  note  on  Convergence  in  Linear 
Programming  Problems,"  Cowles  Commission  Discussion  Paper:  Economics 
No.  2108,  July  14,   1954. 


PART  II 

On  the  Theory  of  Utility  and  Demand 


GERHARD  TINTNER 


External  Economies  in  Consumption1. 


The  importance  of  external  economies  in  production  is  well 
recognized.  Recent  literature  has  treated  the  theory  of  external 
economies  in  consumption.2)  This  may  have  some  importance 
for  welfare  economics.3)  It  is  one  way  of  getting  away  from  the 
unrealistic  assumption  of  a  society  consisting  of  atomistic  in- 
dividuals, who  do  not  influence  each  other.  Such  a  procedure 
has  the  advantage  of  simplicity,  but  as  Aristotle  remarked,  man 
is  a  political  animal  (Politics,  1.29  1252b). 

We  assume  a  society  consisting  of  K  individuals.  Let  u{k) 
be  the  utility  function  or  utility  index  of  the  individual  k 
(k  =  1,  2  .  .  .  K).  Let  xkn  be  the  amount  of  the  commodity  or 
service  n  consumed  by  the  individual  h  (n  =  1,  2  .  .  .  N).  We 
assume  that  the  money  income  Mk  of  the  individual  is  given,  as 
well  as  the  prices  ftn  (n  =  I,  2  .  .  .  N)  of  all  commodities  and 
services.  We  denote  by: 

(1)  *»  =  i*»,     (n=l,2...N) 

k=l 

the  total  amount  of  commodity  or  service  n  demanded.  We 
neglect  the  possibility  of  coalitions  and  other  complications 
arising  in  the  theory  of  games. 

x)  I  am  much  obliged  to  Dr.  R.  L.  Basman  (Hanford,  Wash.)  and  to  M. 
Wimbee  (Paris,  France)  for  advice  and  criticism  of  this  paper.  Journal  Paper 
No.  J-3766  of  the  Iowa  Agricultural  and  Home  Economics  Experiment  Station, 
Ames  Iowa.  Project  No.  1200. 

2)  J.  S.  Duesenberry,  Income,  Savings  and  the  Theory  of  Consumer' s  Behavior. 
(Cambridge,  Mass.:  Harvard  University  Press,  1949),  H.  G.  Johnson,  "The 
Effects  of  Income  Redistribution  on  Aggregate  Consumption  with  Interdepend- 
ence of  Consumer's  Preferences,"  Economica,  vol.  19  (1952),  131  ff.  J.  S. 
Prais  and  H.  S.  Houttakker,  The  Analysis  of  Family  Budgets  (Cambridge: 
Cambridge  University  Press,   1955). 

3)  G.  Tintner,  "A  Note  on  Welfare  Economics,"  Econometrica,  vol.  14  (1946), 
69  ff. 
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Following  Allen,4)  we  denote  by: 

(2)  EfjEz  =  (dfldz)(zjf) 

the  elasticity  of  a  function  /  with  respect  to  a  variable  z. 
We  present  first  the  classical  atomic  theory.5)  Let 

(3)  u^  =  u™{xkl,  xk2>.  ..xkN) 

be  the  utility  function  of  the  individual  k,  which  depends  only 
upon  the  quantities  of  commodities  and  services  possessed  by 
himself.  The  function  (3)  has  to  be  maximized  under  the  budget 
restriction: 

n=l 

Now  we  define: 

(5)  du^/dx,,  =  <;  d*u<*>idxkrdxks  =  •$„ 

and  the  matrix  with  N  +  1  rows  and  columns : 

o   «£> 


(6)  £/<*>  = 


Mks     ^kr,  ks- 


(r,  s  =  l,2...N). 


Let  UkkJ  be  the  cofactor  of  u[kJ  in  the  inverse  matrix  U^'1 
and  U(kJ}  ks  the  cofactor  of  uk®  ks  in  the  same  matrix.  Then  we 
have  the  well  known  results: 

(7)  dxJdMk  =  LkU$ 

(8)  dxjdpr=  -LkUkkJxkr+LkUi»  ks=  -xkr(dxkJdMk)  +LkUkr_ks. 

In  these  formulae  Lk  is  a  Lagrange  multiplier,  the  marginal 
utility  of  money  for  individual  k.  We  have  for  superior  com- 
modities: 

(10)  dxJdMk  >  0 
and  for  inferior  commodities: 

(11)  dxkJdMk  <  0. 
LkUkk})ks  is  the  substitution  term.  If 

(12)  tffiU<o 

4)  R.  G.  D.  Allen,  Mathematical  Analysis  for  Economists  (London:  Macmillan, 
1949),  pp.  251  ff. 

6)  J.  R.  Hicks,  Value  and  Capital  (2nd  ed.,  Oxford:  Clarendon  Press,  1956). 
P.  A.  Samuelson,  Foundations  of  Economic  Analysis,  (Cambridge,  Mass.: 
Harvard  University  Press,  1947,  H.  Hotelling,  "Demand  Functions  with 
Limited  Budgets,"  Econometrica,  vol.  3   (1935)   66  ff. 
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then  the  commodities  r  and  s  are  substitutes.  If 

(13)  ffi!U>0 

then  r  and  s  are  complementary  commodities  or  services. 

We  see  from  formula  (8)  that  we  may  have  dxks/dps  >  0,  in 
spite  of  the  fact  that  C7gJ  k8  <  0.  This  is  the  Giffen  phenomenon 
which  occurs  for  inferior  goods. 

Now  we  define: 

(14)  mkr  =  prxJMk 

the  proportion  of  the  expenditure  of  individual  k  on  commodity 
r  in  relation  to  his  total  expenditure  or  income.  Also  the  partial 
elasticity  of  substitution: 

(15)  ag>  „  =  (2  **„<>)K£U,/(**,  •  »*) 

The  elasticity  with  respect  to  price  is  now  given  by: 

(16)  ExJEpr  ==  mkr(akr>ks  -  ExJEMk). 

For  the  total  demand  in  the  market  for  commodity  s  we  have: 

(17)  .   3XJdMk  =  dxJdM> 

(18)  dXJdfi,  =  I  dxjdp, 

k=l 

and  for  the  elasticities: 

(19)  EXJEMk  =  ExJEMk 

(20)  EXJEf,  =  |  (ExJEfir)  (xJX.). 

The  price  elasticity  of  demand  for  the  total  demand  in  the 
market  is  the  weighted  sum  of  the  individual  demand  elasticities, 
the  weights  are  the  proportions  xks/xs  of  the  quantity  demanded 
by  individual  k  in  relation  to  the  total  demand  for  the  com- 
modity in  question. 

Now  we  assume  that  the  utility  index  depends  on  all  the 
incomes  in  the  society: 

(21)  »(*)  =  vW(xkl,  xk2...  xkN;  Mlf  M2  .  .  .  MK) 

This  is  perhaps  a  somewhat  realistic  assumption,  since  the  tastes 
of  individuals  depend  upon  the  social  class  of  the  individuals 
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and  social  standing  is  to  some  extent  depending  upon  income, 
at  least  in  America.6) 
We  define: 

(22)  BvW/dMt  =  f>g>. 
Now  we  have: 

(23)  dxJdM,  =  LkU[»  ,  6kt  -  |  Ufl hnvh%. 

n=l 

In  this  formula  dkt  denotes  the  Kronecker  delta  dkt  =  1  if  h  ==  t% 
dkt  =  0  otherwise. 

The  income  elasticity  appears  now  as: 

(24)  Ex  JEM,  =  dkt(ExJEMk)0  (Mt/Mk) 

-i(Ev^lEMt)min^]ktt. 

n=l 

In  this  formula  (ExkJEMk)0  is  the  income  elasticity  as  defined 
in  formula  (11)  for  the  atomic  case.  Let  us  put  t  =  k,  i.e.  we 
consider  the  elasticity  with  respect  to  the  income  of  the  individual 
k  in  question: 

(25)  Ex  J  EM  k  =  (ExJEMk)0- 1  (Evg/EMJo™  kn. 

We  see  from  this  formula,  that  income  elasticities  and  elasticities 
of  substitution  are  no  longer  independent.  Because  of  the 
dependence  of  the  utility  function  on  all  incomes,  we  may  have 
in  (25)  ExkJEMk  <  0  even  if  the  atomistic  income  elasticity 
(ExksIEMk)0  is  positive,  i.e.  we  have  a  superior  good  in  the 
classical  sense.  This  is  due  to  the  sum  in  formula  (25),  which 
may  be  large  and  positive. 

Similarly  we  have  for  the  total  demand: 

(26)  EXJEM,  =  (ExJEMt)0-2  f  (JEvfJEM t)o™kn. 

k=l  n=l 

Now  we  consider  the  most  general  case  in  which  the  utility 
of  each  individual  depends  upon  the  consumption  of  all  individuals 
in  the  given  society.  We  may  think  of  the  Veblen  effect  and 
similar  phenomena.7) 

6)  T.  Parsons  and  N.  J.  Smelser,  Economy  and  Society  (London:  Routledge,. 
1956). 

7)  T.  Veblen,  Theory  of  the  Leisure  Class  (New  York:  Modern  Library,  1945).. 
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(27)  w^  =  w{h)(xllf  x12  .  .  .  xlN>  x21> 

x22,  .  .  .  xm  .  .  .  xK1,  xK2  .  .  .  xKN). 

We  define  a  compound  matrix,  i.e.  a  matrix  consisting  of 
matrices: 

(28)  W=[A^l     {i,j=l,2...K)t 
where  the  individual  matrices  are  defined  as  follows: 

[0      d    w(i)~~\ 

The  inverse  of  the  matrix  W  (28)  is  again  a  compound  matrix: 

(30)  W-1  =  [£<*>]. 
Now  we  obtain: 

(31)  dxJBMt  =  B»»  ■  Lt. 
The  income  elasticity  is: 

(32)  Ex  JEM  t  =  Bff>  ■  Lt  ■  Mtjxks. 
We  obtain  also: 

(33)  dxjdp,  =  - 1  xit  B<?>  Lt  +  |  L(  B«% 
and  for  the  price  elasticity: 

(34)  ExJEfi,  =  J,-mir(ExJEMl)+miro™i,  ■  (xjxj. 

t=l 

In  formula  (34)  we  denote  by  o$u  the  partial  elasticity  of 
individual  k  for  the  substitution  of  commodity  r  for  commodity  s 
by  the  individual  i. 

For  the  market  demand,  we  obtain  the  income  elasticity: 

(35)  EX JEM \  =  |  (Ex  JEM t) 
and  for  the  price  elasticity: 

(36)  EXJEp,  =  ||  -m^ExJEM^+m^,.  (xjxj. 
The  methods  used  by  Wald  to  derive  empirical  indifference 
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systems8)  and  the  recent  studies  based  upon  household  budget 
surveys9)  give  us  hope  that  an  empirical,  econometric  verifica- 
tion of  the  theory  may  be  possible. 


8)  A.  Wald,  "The  Approximate  Determination  of  Indifference  Systems  by 
Means  of  Engel  Curves,"  Econometrica,  vol.  8  (1940)  144  ff.  See  also  the  results 
of  J.  A.  Nordin  in  G.  Tintner,  Econometrics  (New  York:  Wiley,  1952),  pp.  60  ff. 

9)  R.  Stone,  D.  A.  Rowe,  W.  J.  Corlett,  R.  Hurstfield,  and  M.  Potter,  The 
Measurement  of  Consumer' s  Expenditure  and  Behaviour  in  the  United  Kingdom, 
1920—38  (Cambridge:  Cambridge  University  Press,  1954)  H.  Wold  and  L. 
Jureen,  Demand  Analysis  (New  York:  Wiley,  1953).  J.  S.  Prais  and  H.  S.  Hout- 
hakker,  op.  cit. 


RALPH  W.  PFOUTS 


Hours  of  Work,  Savings  and  the  Utility 
Function 


The  contributions  of  Harold  Hotelling  to  the  theory  of 
utility  and  demand  place  him  in  a  small  group  of  pioneers  that 
altered  the  fundamentals  of  economic  theory.  His  work,  coming 
at  about  the  same  time  as  that  of  Hicks,  Allen  and  Schultz, 
and  coinciding  approximately  with  the  discovery  of  Slutsky's 
work,  helped  to  lay  to  rest  older  concepts  of  measurable  utility, 
to  unite  utility  and  demand  theory  and  to  rid  the  theory  of 
unnecessary  ceteris  paribus  assumptions.  Through  these  advances 
a  coherent  theory  of  utility  and  demand  was  provided  for  the 
segments  of  economics  that  must  rely  on  utility  and  demand 
theory. 

The  purpose  of  this  paper  is  to  present  a  modest  extension 
of  this  utility  and  demand  theory  that  has  become  the  present- 
day  orthodoxy.  Specifically,  the  suggestion  will  be  made  that 
hours  of  work  and  savings  are  variables  affecting  psychological 
values  and  are  deserving  of  consideration  to  the  same  extent 
as  the  quantities  of  the  various  goods  consumed. 

I 

The  Assumptions 

If  hours  of  work  and  savings  are  inserted  into  the  utility 
function,  several  additional  results  are  achieved  that  cannot 
be  deduced  in  the  case  in  which  only  the  quantities  of  the 
goods  appear  as  independent  variables.  In  the  case  that  in- 
cludes hours  of  work  and  savings  in  the  utility  function,  a  more 
elaborate  and  more  realistic  prototype  of  the  consumer  is 
available,  and  monetary  considerations  can  be  brought  into 
the  pattern  of  consumer  behavior.  This  suggests  that  the  more 
elaborate   case   may   yield   a   consumer   element    suitable    for 
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aggregation  into  marginal  propensity  to  save  or  marginal  pro- 
pensity to  consumer  schedules,  thus  bringing  micro-theories 
and  macro-theories  into  closer  alignment.  Furthermore,  new 
demand  restrictions,  in  addition  to  modification  of  known 
restrictions,  are  obtained  from  this  formulation. 

In  this  paper  we  accept  without  extensive  justification  the 
desirability  of  including  hours  of  work  and  savings  in  the  utility 
function.  In  any  case,  this  seems  unnecessary  because  discussions 
of  demand  for  leisure  can  be  found  in  economic  literature  dating 
from  the  time  of  the  classical  economists.1)  It  would  seem  clear 
and  acceptable  that  hours  of  work  affect  both  the  individual's 
level  of  satisfaction  and  his  income. 

Similarly  there  seems  to  be  little  need  for  justifying  the 
inclusion  of  savings  in  the  utility  function.  If  savings  were  not 
actively  desired  by  individuals,  persons  in  the  lower  and  middle 
income  groups  would  spend  their  money  on  goods  and  services 
entirely,  and  savings  would  be  accumulated  only  by  those  with 
such  large  incomes  that  they  could  not  spend  the  entire  amounts 
on  consumers'  goods  and  services.  As  Lord  Keynes  put  it,  an 
act  of  saving  by  an  individual  "means  —  so  to  speak  —  a 
decision  not  to  have  dinner  today."2) 

Thus  we  write  the  utility  function  as 

(1)  U  =  0(xlt  .  .  .,  xni  y,  s), 

where  the  x's  indicate  quantities  of  goods  and  services,  y 
represents  hours  of  work  and  s  stands  for  current  savings  in 
dollars.  We  assume  that 


x)  Among  the  more  noted  and  more  recent  writings  on  this  subject  must  be 
listed:  A.  C.  Pigou,  The  Economics  of  Welfare  (London,  1929),  p.  573,  and 
A  Study  in  Public  Finance,  (London,  1928),  pp.  83 — 84.  F.  H.  Knight,  Risk, 
Uncertainty  and  Profit  (New  York  and  Boston,  1921),  p.  117.  L.  Robbins, 
"On  the  Elasticity  of  Demand  for  Income  in  Terms  of  Effort,"  Economica,  X 
(1930),  123 — 29,  reprinted  in  A.  E.  A.  Readings  in  the  Theory  of  Income  Distribu- 
tion, W.  Fellner  and  B.  F.  Haley,  eds.  (Philadelphia  and  Toronto,  1946),  pp. 
237 — 44.  F.  W.  Paish,  "Economic  Incentive  in  Wartime,"  Economica,  N.  S. 
VIII  (1941),  239 — 48.  G.  Cooper,  "Taxation  and  Incentive  in  Mobilization," 
Quarterly  Journal  of  Economics,  LXVI   (1952),  43 — 66. 

2)  Keynes,  The  General  Theory  of  Employment,  Interest  and  Money  (London 
and  New  York,  1936),  pp.  19,  21,  64—65,  210—13. 
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In  other  words  goods  and  savings  have  utility;  work,  at  least 
after  a  certain  time,  has  disutility. 

Since  current  savings  are  introduced  into  the  utility  function, 
a  stock  of  earning  assets  must  be  considered  in  the  budget 
equation  as  a  source  of  income.  In  addition,  since  hours  of  work 
are  included  in  the  utility  function,  it  is  appropriate  to  include 
the  wage  rate  in  the  budget  equation.  As  a  consequence  the 
budget  equation  is 

(2)  2pixi  +  s  =  wy  +  rM. 

i 

In  this  equation  the  ft's  are  the  prices  of  the  goods  and  services, 
w  is  the  wage  rate,  M  is  the  stock  of  earning  assets  and  r  is  the 
rate  of  return.  The  ft's,  w  and  r  are  assumed  to  be  mutually  in- 
dependent variables.  Clearly  M  includes  the  accumulation  of 
savings  from  past  periods,,  but  it  may  also  include  windfalls.3) 
There  are  advantages  to  be  obtained  from  writing  the  utility 
equation  in  the  form  of  (1)  and  the  budget  equation  in  the  form 
of  (2)  that  we  will  note  only  briefly.  In  the  first  place  these 
equations  specifically  include  hours  of  work  and  consequently 
should  be  readily  usable  in  welfare  economics.  That  is,  an  in- 
dividual's economic  welfare  depends  not  only  the  income  he 
receives,  or  the  goods  and  services  he  purchases,  but  also  on  the 
hours  of  work  he  must  render  to  obtain  the  income,  or  the 
goods  and  services.  Through  introducing  hours  of  work  directly 
into  the  equations  that  determine  the  consumer's  equilibrium 

3)  By  using  rM  in  (2)  we  implicitly  assume  either  that  the  individual  puts 
all  his  earning  assets  in  uses  showing  the  same  rate  of  return  or  that  r  is  an 
average  rate  of  return.  It  would  be  more  realistic  to  write  S^^»  in  place  of  rM 
to  show  a  differentiated  portfolio.  This  would  get  into  complications  that  it 
seems  desirable  to  avoid  for  purposes  of  the  present  paper  i.e.,  we  avoid  all 
problems  of  expectations,  principles  of  portfolio  selection,   etc. 
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we  introduce  directly  an  important  component  of  welfare  that 
previously  was  ignored  or  introduced  by  indirection. 

The  usual  budget  equation  can  be  modified  to  include  hours 
of  work  by  treating  work  as  a  negative  good.  The  budget  restric- 
tion then  is 

i 

The  disadvantage  of  this  equation,  beside  appearing  unnatural, 
is  that  there  is  no  fixed  income  exhibited  as  in  the  common  form 
of  the  budget  equation  and  there  is  no  term  corresponding  to 
rM  in  (2).  As  a  result  the  income  effect  cannot  be  readily  identi- 
fied if  resort  is  made  the  treating  work  as  a  negative  good.  Hence 
the  advantage  of  introducing  hours  of  work  is  offset  by  obscuring 
the  income  effect.  This  difficulty  does  not  arise  in  the  formulation 
advanced  below. 

The  Equilibrium 

Then  to  find  the  consumer's  equilibrium  position  under  the 
assumptions  outlined  above,  we  form  the  equation 

(3)  A  =  0{xit  .  .  .,  xn,  y,  s)  +  A(2><s,  +  s  -  wy  -  rM). 

i 

This  yields  equations  that,  together  with  (2),  describe  the 
maximum  satisfaction  position  for  the  consumer: 

—  =  04  +  Xpt  =  0,  (i  =  1, .  , ,,  *). 

dA 

(4)  —  =  0y  -  Xw  =  0, 
oy 

—  =  0S  +  X  =  0. 

OS 

Equations  (4)  and  equation  (2)  are  clearly  analogous  to  the 
orthodox  method  of  determining  the  consumer's  equihbrium 
position. 

II 

The  Effects  of  Price  Changes 
As  would  be  expected  from  recalling  utility  theory  that  does 
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not  include  hours  of  work  and  savings,  the  most  interesting 
points  arise  from  consideration  of  displacements  from  the 
equiHbrium  condition.  Thus  a  change  in  the  price  of  the  &th 
good  leads  to  the  equations 

„..    ^        '      dy        ^     ds  dX 

3  Opk  Vpk  Opk  Vpk 

where  dik  is  the  Kronecker  delta  and  i  =  1,  .  . .,  n. 
In  addition  we  have 

^  ^    $x3       ^     dy       ^     ds  dX 

i         vpk  vpk  opk  opk 

^  ^    dx,       ^     dy        ^     ds        dX 
(6)  1  Osj-^  +  0syJL  +  0Jr=o> 

3  °Pk  °Pk  Vpk  Opk 

dx,  dy        ds 

3       opk  opk      dpk 

Then  proceeding  in  the  orthodox  manner  we  make  use  of  the 
determinant 
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and  Cramer's  rule  to  obtain 


■w     1 


(8) 
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In  (8),  (9)  and  (10)  the  subscripts  to  D  indicate  cofactors.  The 
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last  three  equations  are  clearly  comparable  to  the  Slutsky-Hicks 
equation.4  They  show  the  effect  of  a  price  change  on  the 
quantities  of  the  goods  purchased  by  the  individual,  the  hours 
of  work  the  individual  wishes  to  offer  and  the  savings  that  he 
wants  to  lay  by.  In  each  case  the  effect  of  the  price  change  is 
divided  into  two  parts,  again  obviously  comparable  to  the 
substitution  and  income  effects  of  the  Slutsky-Hicks  equation. 
To  continue  the  parallel  with  the  Slutsky-Hicks  equation,  we 
examine  the  income  effects  directly.  Under  the  assumptions 
that  we  are  using,  there  is  no  single  income  effect,  but  rather 
income  effects  arising  from  either  a  change  in  the  rate  of  return 
or  a  change  in  the  stock  of  earning  assets.  Thus  if  we  differentiate 
equations  (4)  and  equation  (2)  with  respect  to  r,  the  rate  of 
return,  we  get 

?«.2'+vi+^*5-v#-> ") 

_-  ^     dx,  dy  ds  dX 

j         or  dr  dr  dr 

—  „    dx*  dy  ds      dX 

\0. I  4-  0     —  4-  0 1 =0 

f  9«dr  +  svdr  +     88dr      dr         ' 

_,      dx,  dy      ds 

i        dr  dr       dr 
Solving  (11)  yields 

(12)  ^  =  M%^>  (»•=!.  •■...»). 

(13)  g  =  M^^±i, 

Then  substituting  from  (12)  to  (8),  from  (13)  to  (9)  and  from 

(14)  to  (10)  gives  us 

(15)  - — ='—./— — -r-i  (t—l,...,n) 

v     ;  dpk  D        M  dr  v  ; 

4)  See,  for  example,  J.  R.  Hicks,  Value  and  Capital  (2nd  ed.,  Oxford,  1946) 
pp.  306—9. 


dy  __ 
dp,  ~" 
ds 

Q  Dk,  n+1 

dp* 
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M  dr 

HOURS  OF  WORK,  SAVINGS,  AND  THE  UTILITY  FUNCTION         119 

(16) 

(17) 

In  equations  (15),  (16)  and  (17)  the  effects  of  a  price  change 
on  the  quantity  of  a  good,  the  hours  of  work  and  savings  are 
divided  into  a  substitution  effect  and  an  income  effect,  the 
income  effect  resulting  from  a  change  in  the  rate  of  return.  We 
can  obtain  equations  similar  to  (15),  (16)  and  (17)  by  considering 
the  effects  of  a  change  in  M,  the  stock  of  earning  assets.  To  do 
this  we  differentiate  equations  (4)  and  equation  (2)  with  respect 
to  M  obtaining 

_,        dx,  dy        ,     ds  dX 

2,     dx,  dy  ds  dX 

"  dM  +    n  dM  +     va  dM  dM        ' 


(18) 


^  ^    to*  r     dy  ds        dX 

7    sj  dM  ^  sy  dM  ^     8S  dM  ^  dM        ' 

dxs  dy        ds 

7  Fa  dM  dM  T  dM 


Solving  equations  (18)  yields 

(19)  |g  =  r£sfei,  , v=..  !., 

(20) 


dM 

'      D     ' 

dy 

M  Dn+3,n+l 

dM 

'      D 

ds 

^n+3,  tc+2 

dM 

'       D 

(21) 

Then  substituting  from  (19)  to  (8),  from  (20)  to  (9)  and  from 

(21)  to  (10)  shows 

(22)  dp*~~    ~D~~7dMt 


dy 

1    D    " 

*k   dy 
r   dMy 

ds 

-  ^k,  n+2 

1     D     ~ 

xk    ds 

~~7dM' 
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(23) 

(24) 

Inspection  of  equations  (22),  (23)  and  (24)  again  shows  the 
effect  of  a  price  change  on  the  quantity  of  a  good,  hours  of 
work,  and  saving  divided  into  a  substitution  effect  and  an  in- 
come effect,  the  income  effect  now  being  related  to  a  change  in 
the  stock  of  earning  assets.  Again  the  similarity  to  the  Slut  sky- 
Hicks  equation  is  obvious. 

The  Effects  of  a  Wage  Rate  Change 

A  displacement  from  equilibrium  due  to  a  change  in  the 
wage  rate  also  shows  a  compound  effect  similar  to  that  of  the 
Slutsky-Hicks  equation.  To  approach  this  we  find  the  partial 
derivatives  of  equations  (4)  and  equation  (2)  with  respect  to 
the  wage  rate,  obtaining 

4        dw  dw  dw  dw 
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Solving  equations  (25)  shows 

us  that 

(26) 

d%i  _   iDn+l,i 

dw             D 

1  *     D     ' 

(27) 

dy           ^n+1,, 

dw              D 

+  y     D     . 

(28) 

1      +1,n+2   1   v     n+s>n+2 
dw"          D         '   V      D 

(i  =  1,  .  .  .,  n) 
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The  similarity  of  equations  (26),  (27)  and  (28)  to  the  Slutsky- 
Hicks  equation  is  obvious  and,  as  in  the  case  of  a  response  to  a 
price  change,  we  have  a  choice  of  two  income  effects.  Thus  we 
can  substitute  from  (12)  to  (26),  from  (13)  to  (27)  and  from 
(14)  to  (28)  obtaining 

m  p  =  K£^±+»p,        (,-_lf.  .'.,„,, 

dw  D  M  or 

(30)  ^-X~~W~  +M~dr-' 

rftlSk  &  ,#„+!,  n+2     ,       V      ds 

(31)  dw-X~~W-^MTr' 

Alternatively  we  can  substitute  from  (19)  to  (26),  from  (20) 
to  (27)  and  from   (21)  to  (28);  this  yields 

32  —  =  1—-l-  +  --  —  {i=l,  ...,  n), 

dw  D  r  dm 

(33)  ^"^""d-" +  yaM' 

(34)  a^-^^D^  +  yaM* 

In  these  latter  equations  the  income  effect  shows  the  result  of 
a  change  in  the  stock  of  earning  assets. 


Restrictions  on  Consumer  Behavior 

For  many  years  critics  of  utility  and  demand  theory  have 
argued  that  this  theory  was  too  restrictive  to  constitute  a 
reasonable  description  of  human  behavior.  Basically  the  ar- 
gument contends  that,  in  the  theory,  the  path  of  a  consumer's 
behavior  is  preordained  and  that  he  responds  too  mechanically 
to  changes  in  prices  and  income;  that  too  few  allowances  for 
individual  deviations  are  made;  in  short  that  he  is  Veblen's 
"oscillating  globule  of  desire/'  In  considering  the  validity  of 
this  criticism  it  is  instructive  to  consider  the  changes  in  quantities 
of  goods,  hours  of  work  and  savings  as  developed  in  this  paper. 
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It  is  quite  literally  true  that  in  no  case  developed  in  this 
paper  can  we  be  certain  of  the  direction  of  change  of  quantities 
of  goods,  hours  of  work  or  savings  in  response  to  a  change  in 
prices,  wage  rate,  rate  of  return  or  stock  of  earning  assets. 
That  is,  on  the  basis  of  logic  or  theory  alone,  we  cannot  be 
certain  that  a  change  in  any  dependent  variable  with  respect 
to  any  independent  variable  is  invariably  positive  or  negative. 
For  such  conclusions  we  must  resort  to  empirical  observation. 
Surely  this  is  not  an  overly  restrictive  representation  of  consumer 
behavior. 

Indeed  in  only  two  of  the  cases  considered  does  the  mathe- 
matics show  any  indication  of  the  direction  of  change.  One  of 
these  is  the  well  known  instance: 

dpi  D  '     r    dM' 

i.e.  the  case  of  the  response  of  the  quantity  of  a  good  to  a  change 
in  its  own  price.  In  this  equation  the  first  term  of  the  righthand 
member  must  be  negative.  This  is  true  because  the  principle 
minors  of  the  determinant  in  (7)  oscillate  in  sign,  hence  the  ratio 
of  cof actor  to  determinant  must  be  negative,  while  —X  must  be 
positive.  But  this  is  not  sufficient  to  insure  a  negative  response 
of  the  good  to  a  change  in  its  own  price  because  the  second  term 
of  the  righthand  member  could  be  positive  and  larger  in  absolute 
amount  than  the  first  term. 

The  second  instance  in  which  the  mathematics  gives  some 
indication  of  the  direction  of  change  is  the  case  of  the  response 
of  hours  of  work  to  change  in  the  wage  rate.  The  equation  in 
this  case  is  of  the  type 

dy  =xDn+i,n+i  .    V    ty 
dw  D  r  dM' 

In  this  equation  the  first  term  of  the  righthand  member  is 
obviously  positive,  but  again  this  is  not  sufficient  to  assure 
that  hours  of  work  increase  in  response  to  a  change  in  wage 
rate  because  the  income  term  can  be  sufficiently  negative  to 
offset  the  other  term.  Indeed,  since  the  income  term  represents 
a  change  in  unearned  income,  it  seems  quite  probable  on  empirical 
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grounds  that  the  income  term  will  be  negative  and  quite  large. 
Consequently  we  cannot  be  certain  of  the  direction  of  change  in 
this  case  either,  or,  indeed,  in  any  case  without  reference  to 
empirical  observation. 


Ill 


Some  Characteristics  of  the  Demand  Equation 

The  equihbrium  conditions,  equations  (4)  and  (2),  represent 
n  +  3  conditions  on  the  2^  +  6  variables,  xv  .  .  .,  xn\  pv  .  .  .,  pn; 
y,  s,  w,  r,  M  and  L  Then  assuming  that  there  are  no  mathe- 
matical difficulties,  it  is  possible  to  solve  for  n  +  3  of  the 
variables  in  terms  of  the  remainder.5)  It  is  of  special  interest 
to  view  these  solutions  as  demand  equations  written  as 

(35)  xt  =  ^  (ply  .  .  .,  j>n,  M,  r,  w),     {i  =l,...,n) 

(36)  y  =  K(plt...,pn,  M,  r,  w), 

(37)  s  =  hs{pv  .  .  .,  pn>  M,  r,  w), 

(38)  -k  =  hx{p1,...,pn,  M,  r,  w). 

In  this  form  the  equations  show  as  dependent  variables  the 
quantities  that  are  actually  subject  to  control  by  the  individual. 
The  demand  equations  that  we  have  formulated  are  free  of 
any  particular  index,  i.e.,  it  does  not  make  any  difference  what 
utility  index  we  use.  Thus  the  situation  with  respect  to  the 
choice  of  utility  indicator  is  the  same  as  in  the  case  in  which 
savings  and  hours  of  work  are  not  included  in  the  utility  function. 
The  freedom  from  the  utility  index  may  be  seen  by  rewriting 
equations   (4)  as 

0.  —0 

(39)  -X  =  -1  = v  =  0S,  (i  =  1,  .  .  .,  n). 

Pi  w 

This  shows,  among  other  things,  that  the  marginal  utility  of 
money  is  equal  to  the  marginal  utility  of  savings  in  the  equili- 
brium position.  From  equations  (39)  we  obtain 

5)  See  P.  A.  Samuelson,  Foundations  of  Economic  Analysis  (Cambridge, 
Mass.,   1947),  pp.  99—100. 
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*t  _  pi 

—0V       w 

0,       fi      F'0S' 

-*,  -F'0„ 

=  w  = 


®s  F'0S 

where  F  is  a  monotonic  transformation.  Equations  (40)  show 
that  if  a  monotonic  transformation  of  the  utility  function  were 
made,  it  would  have  no  effect  on  the  equilibrium  position.6) 

The  Absence  of  a  Homogeneity  Condition 

In  the  orthodox  development  of  the  theory  of  individual 
demand,  the  demand  equations  are  homogeneous  of  order  zero 
with  respect  to  prices  and  income.7)  This  arises  from  the  absence 
of  savings  in  the  budget  equation  and  from  equilibrium  conditions 
similar  to  the  first  equations  of  (40).  In  the  formulation  developed 
in  this  paper,  the  individual  demand  equations  are  no  longer 
homogeneous  of  order  zero  with  respect  to  the  independent 
variables.  This  may  be  seen  from  inspection  of  equation  (2). 
Since  savings  have  no  price  in  the  ordinary  sense,  a  proportional 
change  in  the  prices  associated  with  other  variables  does  not 
affect  the  term  representing  savings.  Consequently  the  equality 
in  (2)  is  not  maintained.  Similarly,  from  inspection  of  the  last 
two  equations  in  (40),  it  is  clear  that  a  proportional  change 
in  prices  and  wage  rate  will  alter  the  equilibrium  position. 
Consequently,  the  restriction  that  the  demand  equations  be 
homogeneous  of  order  zero  is  lost  in  the  present  formulation. 
It  is  interesting  to  notice  that  this  restriction  is  preserved  if 
hours  of  work  are  put  into  the  utility  equation  and  savings 
are  not.8) 

6)  Ibid.,  pp.  97—99. 

7)  Ibid.,  pp.   104—7. 

8)  See  F.  Gilbert  and  R.  W.  Pfouts,  "A  Theory  of  the  Responsiveness  of 
Hours  of  Work  to  Changes  in  the  Wage  Rate,"  Review  of  Economics  and 
Statistics,  XL   (1958),  pp.   116—21. 
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The  point  of  the  preceding  paragraph  can  be  developed  more 
finely  by  considering  the  equation 

k       dpk  dM         dr  ow 

(41)  =„lpj^-lhXk^M  +  rM^i 

k  U  k  JJ  u 

the  substitutions  being  made  from  equations  (8),  (19),  (12) 
and  (26).  If  the  homogeneity  condition  held  in  the  present 
formulation,  the  left  member  of  (41)  would  be  identically  zero 
by  Euler's  theorem  on  homogeneous  functions;  we  shall  see 
that  this  is  not  true  in  general.  If  we  add  and  substract 
X{Dn+2fijD)  to  the  right  member  of  (41)  and  rearrange  terms  we 
obtain 

The  first  three  terms  in  expression  (42)  can  be  eliminated  by 
reference  to  (7)  and  the  theorem  that  expansion  of  a  deter- 
minant by  the  elements  of  one  column  and  the  corresponding 
cof actors  of  another  column  yields  zero.  By  reference  to  (2)  it 
follows  that  the  second  three  terms  of  (42)  reduce  to  s(Dn+3i/D). 
Consequently  we  can  write 

v^    dx*    i    i\/rdxi    i      dxi    i       dx* 
*       opk  dM  or  aw 

(43)  Fk 

D  D  D 

=  gjjffi  +  rM     ***-*  +  X     n+2'{. 
D  D  D 

Substituting  from  (19)  and  (24)  gives 
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,     >       _      dx;         dXi  dx,       s    dxt       x,    ds        ds 

(44)  y  i>1f—Ljrr—1 ■  +  w— -  = 

*       dp*         &  to       r  dM       r  BM      dp, 

If  (44)  is  transformed  into  elasticity  form,  we  obtain 

(45)  IVih+Vir  +  V**  =  s\-irj('iliM-VsM)—^--Vsi)' 
k  \rM  p^     I 

While  (44)  and  (45)  lack  the  straight-forward  simplicity  of  the 
homogeneity  condition,  they  do  bring  out  certain  points.  First 
it  may  be  noted  by  reference  to  these  equations  that  the  homo- 
geneity condition  clearly  does  not  hold  under  the  present  as- 
sumptions; further  it  is  difficult  to  visualize  an  assumption  with 
intuitive  appeal  that  would  restore  the  homogeneity  condition. 
Secondly,  the  influence  of  savings  in  upsetting  the  homo- 
geneity condition  is  clearly  exhibited  in  the  righthand  members  of 
these  equations.  Observation  of  the  righthand  member  of  (45) 
shows  that  if  savings  were  zero,  the  homogeneity  condition  would 
almost  be  restored.  But  the  full  homogeneity  condition  would 
not  be  restored  because  the  elasticity  of  the  ith  good  with  respect 
to  the  stock  of  earning  assets  would  still  be  missing  from  the 
lefthand  member  of  (45). 

A  similar  development  can  be  made  for  equation  (36),  the 
equation  expressing  the  demand  of  the  individual  for  hours  of 
work.   In  elasticity  from  the  result  is 


(46)        2  Vvk  +  Vyr  +  Vi 
k 


1  1  1 

^  {VvM  ~  VsM)   +  —  Vswj  ' 


jyM  yw 

Reference  to  (46)  shows  that  the  homogeneity  condition  does 
not  hold  in  general  for  (36).  Again  the  importance  of  savings 
in  removing  the  homogeneity  condition  with  reference  to  the 
demand  for  hours  of  work  is  brought  out  in  the  righthand 
member  of  (46). 

In  the  same  fashion  equation  (37)  can  be  used  to  show  the 
absence  of  the  homogeneity  condition  in  the  demand  for  savings. 
When  exhibited  in  elasticity  form  the  result  is 

(47)  Z  Vsk  +  Vsr  +  Vsw M^sM^ ^ • 

&  rlvl  s         D 

Again  the  homogeneity  condition  is  clearly  not  satisfied.  With 
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regard  to  this  righthand  member  of  (47)  it  may  be  observed 
that  the  ratio  of  cofactor  to  determinant  will  be  negative  since 
principal  minors  of  the  determinant  in  (7)  will  alternate  in 
sign;  at  the  same  time  A  will  be  negative.  Consequently  the  sign 
of  each  member  is  the  same  as  the  sign  of  s.  If  savings  are 
positive  each  member  of  (47)  is  positive,  but  if  dissaving  takes 
place  each  member  is  negative. 

Effects  of  Saving  and  Dissaving 

By  making  certain  assumptions  that  appear  to  be  reasonable 
on  empirical  grounds,  we  can  obtain  some  additional  results 
from  (47).  We  assume  that  the  elasticity  of  savings  with  respect 
to  price  is  negative  but  that  the  elasticities  of  savings  with 
respect  to  wage  rate,  rate  of  return  and  stock  of  earning  assets 
are  positive.  It  follows  from  (47)  that  if 

5  >  0, 
then 

(48)  ?]sr  -\-rjsw —  f]sM  >  —  2  yen- 

rM  & 

Under  the  assumptions  about  the  signs  of  the  elasticities,  the 
first  two  terms  in  the  lefthand  member  of  (48)  are  both  positive 
and  the  third  term  has  a  negative  effect,  while  the  right  member 
is  positive,  consisting  of  a  negative  term  prefixed  by  a  minus 
sign.  Then  (48)  implies  that  savings  respond  more  heavily  to 
changes  in  rate  of  return  and  in  wage  rate  than  they  do  to  prices 
and  to  stock  of  earning  assets  weighted  by  savings  as  a  propor- 
tion of  income  from  earning  assets.  In  other  words,  rate  of  return 
and  wage  rate  dominate  the  weighted  value  of  stock  of  earning 
assets  and  prices  in  affecting  savings. 
It  also  follows  from  (47),  that  if 

s  <  0, 
then 

s 

(49)  rjsw  Jrr]sr —  rjsM  <  —  2  Vsk- 

rM  ft 

Continuing  the  assumptions  concerning  the  signs  of  the  elasticities 
of  savings,  the  first  two  terms  of  (49)  are  positive,  the  third  is 
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positive  but  prefixed  by  a  minus  sign  and  the  last  term  is  negative 
and  prefixed  by  a  minus  sign.  Under  these  conditions  it  follows 
that  dissavings  respond  more  strongly  to  price  changes  than  to 
changes  in  wage  rate,  rate  of  return  and  stock  of  earning  assets 
with  the  appropriate  weight.  Consequently  when  dissaving  takes 
place,  price  influences  dominate  all  other  influences.  Surely 
this  has  a  strong  intuitive  appeal  since  one  would  expect  dis- 
saving to  take  place  at  an  income  level  so  low  that  prices  of 
consumers '  goods  dominated  all  consumers'  actions. 

IV 

New  and  Modified  Slutsky  Conditions 

The  Slutsky  conditions,  which  constitute  restrictions  on  the 
demand  equations,  are  modified  from  their  usual  form  when  hours 
of  work  and  savings  are  included  in  the  utility  function.  This  is 
easily  seen  by  reference  to  equation  (15),  keeping  in  mind  the 
symmetry  of  D.  From  (15)  we  may  write 


dxk          dxk 

And  from  (22)  we  may  write 

/KIN                                          dX*       .              dX* 

(51)              rW^Xhm- 

dxk          dxk 

In  addition  to  the  modified  Slutsky  conditions  of  (50)  and 
(51),  we  can  find  another  restriction  on  the  demand  equations 
similar  to  the  Slutsky  conditions.  From  comparing  equations 
(16)  and  equation  (29),  we  see  that  it  is  possible  to  write 

(52)  m- — y-^-=  -MTT-Xi^~' 

ow  or  dpi  or 

Equation  (52)  is  a  Slutsky-type  restriction  and  presumably 
under  ideal  conditions  would  be  capable  of  being  tested  for 
refutation.  Alternatively,  from  equations  (23)  and  (32)  it  is 
easily  seen  that  we  can  say 

(53)  rdw    ym~    rdft    X,W 


HOURS  OF  WORK,  SAVINGS,  AND  THE  UTILITY  FUNCTION         129 

Some  Restrictions  on  Elasticities 

In  addition  to  the  Slutsky-type  of  restriction,  we  can  bring 
forward  other  restrictions  concerning  the  elasticity  of  the 
dependent  variables  with  respect  to  rate  of  return  and  the  stock 
of  earning  assets.  Put  briefly,  this  class  of  restrictions  says  that 
the  elasticity  of  any  dependent  variable,  i.e.  the  x's,  y  or  s,  with 
respect  to  the  rate  of  return  equals  the  elasticity  of  the  same 
variable  with  respect  to  the  stock  of  earning  assets. 

Thus  from  equations  (12)  and  (19)  we  notice  that 

1    dxt        1   dx{ 
~7dM~M~dr' 
consequently 

M  dx{        r  dxt 

(54)  ^idM^^iTr=nm  =  ni"     (*=1' ••-*)• 

From  a  comparison  of  (13)  and  (20)  it  is  clear  that  we  can  also 
state  that 

(55)  Tjyr  =  VvM- 

It  is  equally  obvious  from  (14)  and  (21)  that 

(56)  r)sr  =  r}sM. 

The  restrictions  on  the  demand  equations  that  are  represented 
by  equations  (54),  (55)  and  (56)  are  restrictions  on  individual 
demand  equations  at  the  same  level  as  the  Slutsky  conditions. 
I.e.  they  arise  from  the  fundamental  assumptions  regarding  the 
utility  equation  and  the  postulate  of  maximization.  In  an  em- 
pirical sense  they  are  both  less  complex  and,  probably,  less 
important  than  the  Slutsky  conditions;  empirically  they  appear 
to  be  less  restrictive  on  the  variables  than  the  Slutsky  conditions. 
In  substance  they  merely  state  that  the  effect  of  a  change  in 
income  from  earning  assets  on  the  quantities  of  goods,  hours  of 
work  or  savings  is  the  same  whether  the  increment  results  from 
a  change  in  rate  of  return  or  a  change  in  stock  of  earning  assets. 

The  Micro-Propensity  to  Save 

By  employing  hours  of  work  and  savings  as  variables  in  the 
utility  function,  we   gain   a   more   realistic  prototype  of  the 
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consumer  and  additional  restrictions  on  the  individual  demand 
equations.  In  addition  the  marginal  propensity  to  save  takes 
on  meaning  in  the  case  of  an  individual.  That  is,  we  can  now 
write 

57  ds  =  —  dw  +  —  dr  +  ^-zdM  +  2  — dp, 

dw  dr  dM  *  dpi 

as  a  meaningful  statement  in  which  substitutions  can  be  made 
and  which  is  related  to  other  aspects  of  the  consumer's  behavior. 
An  individual  propensity  to  save  relation  is  not  possible  in  the 
orthodox  formulation  of  utility. 

Indeed  the  micro-approach  yields  a  more  comprehensive 
statement  of  the  motives  behind  the  marginal  propensity  to 
save  because  it  includes  the  effects  of  relative  prices  in  the  last 
term  of  the  right  member  of  (57).  That  is,  the  macro-approach 
usually  shows  the  propensity  to  save  as  a  function  of  income, 
but  not  of  relative  prices.  The  response  of  savings  to  changes  in 
income,  but  not  prices,  at  the  micro  level  is  given  by 

ds  ds  ds 

(58)  ds  =  — -dw  +—dr  +  —.  dM. 

v     '  dw  dr  dM 

But  equation  (58)  is  strictly  valid  only  if  there  have  been  no 
changes  in  the  prices  of  the  goods  consumed,  or  if  the  last 
term  of  (57),  through  a  coincidence,  is  equal  to  zero.  The  absence 
of  general  validity  for  equation  (58),  coupled  with  the  fact  that 
the  micro  propensity  to  save  is  a  combination  of  many  micro 
propensities  to  save,  suggests  that  a  part  of  the  difficulties  in 
empirical  studies  of  the  macro  propensity  to  save  or  to  consume 
may  be  due  to  their  ignoring  relative  prices  as  independent 
variables. 


PART  III 

On  Economic  Dynamics 


MILTON  FRIEDMAN 


In  Defence  of  Destabilizing  Speculation1] 


Two  propositions  about  private  speculation  are  widely  held: 
first,  that  speculation  is  in  fact  often  destabilizing,  in  the  sense 
that  it  makes  fluctuations  in  prices  wider  than  they  would 
"otherwise"  be;  second,  that  destabilizing  speculation  neces- 
sarily involves  economic  loss.  This  pair  of  propositions  underlies 
much  current  opinion  about  commodity  policy  —  where  they 
lead  to  support  for  "buffer  stocks"  and  similar  plans  —  and 
about  balance  of  payments  policy  —  where  they  constitute  a 
chief  criticism  of  floating  exchange  rates. 

This  note  is  not  intended  to  be  an  exhaustive  analysis  of  this 
pair  of  propositions,  or  of  speculation  in  general.  Its  purpose  is 
much  more  limited:  to  point  out  that  the  second  proposition  is 
invalid,  that  destabilizing  speculation,  though  it  may  in  some 
cases  lead  to  economic  loss,  may  in  others  confer  economic 
benefit.  The  empirical  generalization  about  the  prevalence  of 
destabilizing  speculation  —  which  is  what  gives  the  theoretical 
proposition  its  interest  —  seems  to  be  one  of  those  propositions 
that  has  gained  currency  the  way  a  rumor  does  —  each  man 
believes  it  because  the  next  man  does  —  and  despite  the  absence 
of  any  substantial  body  of  well  documented  evidence  for  it. 
It  is  a  proposition  that  badly  needs  intensive  empirical  in- 
vestigation. My  own  conjecture  is  that  such  an  investigation 
would  show  it  to  be  unfounded.  But  this  is  simply  a  conjecture 
and  plays  no  part  in  what  follows. 

The  ready  acceptance  of  the  proposition  that  destabilizing 
speculation  is  economically  harmful  reflects,  I  believe,  a  natural 
bias  of  the  academic  student  against  gambling  and  in  favor  of 
insurance.  It  is  natural  for  him  to  regard  a  futures  market,  for 
example,  as  a  market  in  which  a  "legitimate"  producer  hedges 

x)  I  am  indebted  for  comments  on  an  earlier  draft  to  Martin  Bailey,  Harry 
Johnson,  James  Meade,  Joan  Robinson,  and  Dennis  Robertson. 
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his  risks  by  transferring  them  to  a  "speculator";  the  producer 
is  viewed  as  buying  "insurance"  from  the  speculator.  But 
granted  that  this  is  a  possible  and  indeed  likely  interpretation 
of  an  actual  futures  market,  it  is  not  the  only  possible  one.  May 
such  a  market  not  be  one  in  which  the  "legitimate"  producer 
engages  as  a  side-line  in  selling  "gambles"  to  speculators  willing 
to  pay  a  price  for  gambling  and  knowingly  doing  so?  And  if  so, 
moral  scruples  about  gambling  aside,  is  any  economic  loss  in- 
volved? 

In  arguing  that  destabilizing  speculation  need  not  involve 
economic  loss  I  do  not  mean  in  any  way  to  deny  the  usual  view 
that  stabilizing  speculation  confers  benefit.  In  this  usual  view, 
the  economic  function  of  speculation  is  taken  to  be  the  reduction 
of  inter-temporal  differences  in  price.  In  a  commodity  market, 
for  example,  a  speculator  is  viewed  as  performing  this  function 
by  buying  when  the  crop  is  plentiful  and  prices  "abnormally" 
low,  holding  stocks  of  the  commodity  until  prices  have  risen, 
and  then  selling  when  the  crop  is  short  and  prices  "abnormally" 
high.  In  this  way,  speculators  transfer  resources  from  less  to 
more  urgent  uses.  The  difference  between  the  prices  at  which  they 
sell  and  buy  is  their  margin,  which  must  cover  costs  of  storage 
and  furnish  their  remuneration.  The  excess  over  storage  costs 
is  a  payment  for  specialized  skill  in  knowing  when  to  buy  and 
when  to  sell  and  perhaps  also  for  bearing  risk. 

This  model  takes  for  granted  that  there  is  a  meaningful 
distinction  between  speculative  and  other  transactions,  that  one 
can  speak  of  what  the  price  would  have  been  in  the  absence  of 
speculation.  This  is  a  point  that  raises  many  difficulties  and 
requires  careful  examination  in  any  full  analysis  of  speculation.2) 
We  can,  however,  evade  it  for  our  purposes  by  narrowing  the 
question  under  discussion.  Consider  any  market  in  operation. 
Suppose  that  an  additional  set  of  transactions  are  made  in  that 
market  by  an  additional  group  of  people  whom  we  shall  call 
"speculators"  or  "new  speculators".  We  shall  then  deal  only 
with  the  question  whether  this  additional  set  of  transactions 

2)  See,  for  example,  the  comments  by  W.  J.  Baumol,  "Speculation,  Profi- 
tability, and  Stability,"  Review  of  Economics  and  Statistics,  August,  1957, 
pp.  263—71. 
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increases  the  fluctuations  in  price  and,  if  it  increases  them, 
whether  it  involves  an  economic  loss  or  confers  a  gain.  By  dealing 
in  this  way  with  a  change  in  the  amount  of  speculation,  we  can 
avoid  the  troublesome  intellectual  problem  of  defining  zero 
speculation  without  any  essential  loss  in  generality.  We  shall 
make  one  further  assumption  to  evade  a  troublesome  problem: 
namely,  that  the  activities  of  speculators  do  not  affect  the 
quantities  demanded  and  supplied  by  other  participants  in  the 
market  at  each  current  price.  This  implies  that  there  is  a  well- 
defined  price  that  will  clear  the  market  at  each  point  in  the 
absence  of  speculation  and  that  this  price  is  not  affected  by 
speculation. 

With  these  assumptions,  it  is  clear  that,  if  carrying  costs  are 
neglected,  our  model  implies  that  speculators  gain  if  they  reduce 
inter-temporal  differences  in  price,  and  lose  if  they  widen  such 
differences.  Speculators  can  fill  in  the  troughs  of  price  move- 
ments only  by  buying  net  when  prices  would  otherwise  be  low; 
they  can  flatten  out  the  peaks  only  by  selling  net  when  prices 
would  otherwise  be  high;  unless  they  carry  this  so  far  as  to 
reverse  peaks  and  troughs,  they  gain  by  the  difference.  Con- 
versely, speculators  can  make  fluctuations  wider  (in  the  same 
direction)  only  by  selling  net  when  prices  would  otherwise  be 
low  and  buying  net  when  prices  would  otherwise  be  high.  But 
this  means  that  they  sell  at  a  lower  price  than  they  buy  and  so 
make  losses.  Our  model  therefore  implicitly  defines  stabilizing 
speculation  as  speculation  yielding  gains  (carrying  costs  aside) 
and  destabilizing  speculation  as  speculation  yielding  losses. 
The  circumstances,  if  any,  under  which  this  will  not  be  true 
deserve  extensive  examination  in  a  full  analysis  of  speculation 
but  can  be  accepted  for  our  more  limited  purposes,  which  is 
simply  to  show  that  destabilizing  speculation  need  not  involve 
economic  loss,  not  that  it  cannot  do  so.3) 

8)  See  Ibid,  for  one  such  fuller  examination.  It  will  be  clear  that  our  as- 
sumptions rule  out  the  main  case  there  considered. 

Baumol  also  considers  a  special  case  corresponding  to  our  assumptions 
(pp.  269 — 70).  His  own  conclusion  is  ambiguous  but  only  because  in  judging 
the  profitability  of  the  speculation  he  does  not  require  it  to  be  carried  through 
to  completion,  in  the  sense  that  the  speculators  end  up  in  their  initial  position 
with  respect  to  the  holdings  of  the  speculative  commodity. 
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One  reason  why  actual  speculation  might  not  conform  to  the 
model  described  in  the  preceding  three  paragraphs  is  avoidable 
ignorance.  By  no  means  all  actions  that  are  mistakes  when 
viewed  ex  post  fall  into  this  category.  If  I  wager  even  money 
that  a  coin  will  come  up  tails  and  it  comes  up  heads,  I  clearly 
have  made  a  mistake  ex  post,  in  the  sense  that  I  shall  wish  that 
I  had  chosen  heads.  If,  in  addition,  I  discover  by  an  examination 
of  the  coin  that  it  has  heads  on  both  sides  or  in  some  other  way 
is  biassed  toward  heads,  and  if  I  could  have  made  this  examina- 
tion before  the  wager,  then  I  have  also  made  a  mistake  ex  ante. 
On  the  other  hand,  if  such  additional  examination  gives  me  no 
more  reason  than  I  had  before  to  question  my  belief  that  the 
coin  is  fair,  then  my  initial  choice  of  tails  may  be  bad  luck  but 
cannot  be  described  as  a  mistake.  The  distinction  between  the 
two  cases  is,  in  principle,  whether  I  would  have  acted  differently 
in  advance  of  the  actual  toss  if  I  had  had  the  knowledge  I 
gained  after  the  toss  except  for  the  actual  outcome  itself,  i.e., 
if  I  had  had  the  knowledge  that  it  would  have  been  possible  for 
me  to  have  had  before  the  toss.  In  the  same  way,  the  mere  fact 
that  speculators  make  losses  over  a  particular  period  and  in  fact 
destabilize  prices  for  that  period  is  no  evidence  either  that  the 
losses  could  have  been  avoided  given  the  general  state  of  knowl- 
edge when  the  speculation  was  entered  into  or  that  speculation 
is  on  balance  destabilizing  in  any  more  fundamental  sense. 

If  destabilizing  speculation  does  arise  from  avoidable  ignorance, 
it  must  be  granted  immediately  that  there  is  an  economic  loss. 
The  loss  is  borne  primarily  by  the  speculators,  though  if  the 
operation  is  sufficiently  large,  second  order  effects  on  others 
may  not  be  negligible  in  the  aggregate.  It  may  be  noted  in  passing 
that  insofar  as  this  case  justifies  any  action  by  government,  it 
justifies  solely  the  distribution  of  knowledge.  Suppose  private 
speculation  is  destabilizing  because  ignorant  speculators  behave 
against  their  own  interests,  but  speculation  by  government 
officials  trying  to  achieve  the  same  end  as  private  speculators 
would  be  stabilizing  because  of  greater  knowledge.  The  appro- 
priate solution  is  then  for  the  government  officials  to  make  their 
knowledge  available  either  by  providing  the  information  on 
which  their  price  forecasts  rest  or  by  making  and  publishing 
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the  price  forecasts  themselves.  If  these  are  more  accurate,  on 
the  whole,  than  the  forecasts  private  speculators  would  otherwise 
use,  private  speculators  have  a  strong  incentive  to  act  in  ac- 
cordance with  them  and  in  the  process  will  produce  the  same 
results  as  government  speculation  in  accordance  with  the  same 
forecasts.  If  the  forecasts  are  not  more  accurate,  they  will  tend 
to  be  disregarded  and  no  great  harm  will  be  done.4) 

To  see  how  destabilizing  speculation  can  arise  without  avoid- 
able ignorance,  let  us  start  with  a  commodity  market  which 
is  in  operation.  Suppose  that  there  exists  independent  gambling 
establishments  in  which  all  gambling  takes  the  form  of  betting 
on  the  future  price  of  the  commodity  in  question  —  say  rubber. 
The  people  who  bet  on  the  price  of  rubber  in  the  hypothetical 
gambling  establishment  do  not  buy  or  sell  rubber,  and  neither 
do  the  people  who  run  the  establishment.  Their  operations 
therefore  have  no  direct  effect  on  the  price  of  rubber;  the  rubber 
market  simply  takes  the  place  of  the  roulette  wheel  at  Monte 
Carlo.5)  We  may  suppose  the  proprietors  of  an  establishment  to 
operate  solely  as  brokers,  engaging  in  no  gambling  themselves 
but  being  paid  a  fee  for  providing  facilities  and  bringing  together 
people  willing  to  take  opposite  sides  of  a  common  wager.  And  we 
suppose  throughout  that  the  people  engaging  in  the  gambling 
do  so  deliberately  and  are  reasonably  well  informed:  they  like 
to  gamble  and  are  willing  to  pay  a  price  to  do  so.  Let  us  put  to 
one  side  any  moral  objections  to  gambling,  and  suppose  that 
the  gambling  services  are  provided  under  competitive  con- 
ditions. The  proprietors  of  the  gambling  house  are  then  devoting 
economic  resources  to  producing  services  to  satisfy  the  wants 
of  consumers,  who  are  willingly  buying  the  services  and  paying 
a  price  equal  to  the  cost  of  the  alternative  services  that  could  have 
been  obtained  with  the  same  resources.  Clearly  there  is  economic 

4)  One  case  in  which  publication  of  forecasts  or  the  equivalent  might  be 
especially  called  for  is  if  the  authorities  feel  it  necessary  to  suppress  some 
relevant  information  for  security  reasons.  They  might  be  able  to  offset  the 
effects  of  such  suppression  on  the  judgments  of  traders  by  issuing  price  forecasts. 

6)  There  could  be  an  indirect  effect  if,  for  example,  information  about  the 
odds  ruling  in  the  gambling  transactions  altered  the  expectations  about  future 
prices  of  the  people  trading  on  the  spot  market  and  so  changed  amounts  diverted 
to  stocks. 
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gain  rather  than  loss  through  the  operation  of  the  gambling 
house.6) 

Of  course,  there  may  in  fact  be  no  demand  for  this  service  at  a 
price  sufficient  to  call  it  forth.  Whether  there  is  depends  on  the 
preferences  of  the  public  for  gambling  of  various  types,  the  kind 
of  gambling  provided  by  the  rubber  market  —  that  is,  the 
probability  distribution  of  the  price  of  rubber  —  the  alternative 
sources  of  gambling  services,  their  cost  and  character,  and  so  on. 
The  willingness  of  people  to  buy  lottery  tickets  at  less  than  their 
actuarial  value  even  though  they  know  full  well  the  probabilities 
of  prizes  of  various  size  is  sufficient  evidence  that  people  are 
willing  to  pay  a  price  to  bear  at  least  certain  kinds  of  risks,  to 
be  subjected  to  increased  uncertainty.  In  any  event,  our  concern 
is  not  with  the  likelihood  that  gambling  establishments  of  the 
kind  described  would  be  profitable  but  only  with  the  consequences 
if  they  were. 

Consider  an  individual  who  wants  to  bet  that  the  price  of 
rubber  will  be  higher  a  month  from  now  than  it  is  now.  He  can 
place  such  a  bet  in  the  gambling  establishment  at  some  odds  and 
subject  to  paying  a  commission  to  the  proprietors.  An  alter- 
native way  in  which  he  can  subject  himself  to  the  same  un- 
certainty is  to  buy  rubber  in  the  market,  store  it  for  a  month, 
and  than  sell  it:  he  can  accumulate  positive  stocks.  The  cost 
in  this  case  is  the  cost  of  storage  over  the  month.  Similarly  an 
individual  who  wants  to  bet  that  the  price  will  fall  can  ac- 
complish the  same  objective  by  selling  rubber  now,  borrowing 
the  physical  commodity  in  order  to  make  delivery  currently: 
he  can  accumulate  negative  stocks.  He  may  be  paid  for  doing 
so,  because  he  saves  someone  storage  costs.  Presumably  however, 
the  amount  he  is  paid  will  be  less  than  the  storage  costs,  the 
difference  being  the  fee  for  lending  the  commodity.  And  if  the 
loan  requires  dipping  into  stocks  needed,  say,  to  facilitate 
production,  storage  costs  may  be,  as  it  were,  negative  and  he 

6)  It  will  be  noted  that  a  pure  futures  market  is  very  close  to  such  a  gambling 
establishment.  A  transaction  on  a  futures  market  does  not  by  itself  have  any 
effect  on  the  spot  market.  It  affects  current  price  only  to  the  extent  that  the 
price  established  leads  to  operations  on  the  spot  market  and  thereby  to  a  change 
in  the  size  of  stocks  carried  over.  This  is  analogous  to  the  indirect  effect  described 
in  the  preceding  footnote. 
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may  have  to  pay  to  borrow  the  goods.  (Remember  that  we  are 
considering  the  effect  of  the  actions  of  an  additional  group  of 
people.  Their  holding  negative  stocks  simply  means  that  total 
stocks  are  less  than  they  would  otherwise  be).  Suppose  in- 
dividuals find  this  alternative  way  of  gambling  cheaper.  The 
gambling  establishments  will  then  disappear  and  the  gambling 
services  be  provided  by  the  rubber  market. 

If  purchases  and  sales  just  offset,  there  is  no  effect  on  current 
price  and  the  net  costs  are  the  various  commissions  paid  to 
transact  the  business.  The  market  dealers  have  taken  the  business 
of  providing  gambling  services  away  from  the  gambling  in- 
stitutions proper.  But  purchases  and  sales  need  not  just  offset 
one  another  —  indeed,  the  lack  of  necessity  for  them  to  do  so 
may  be  one  of  the  advantages  of  operating  through  the  market, 
though  a  similar  possibility  could  be  provided  by  the  gambling 
establishments  if  their  proprietors  "made  book"  rather  than 
simply  acted  as  brokers.  If  purchases  and  sales  do  not  offset  one 
another,  the  price  of  the  commodity  is  affected. 

We  have  now  combined  the  two  activities:  gambling  on  the 
price  of  rubber,  and  the  rubber  market  proper.  Given  com- 
petitive conditions,  this  combination  will  occur  only  if  it  is  a 
cheaper  way  to  provide  gambling  services  and  so  in  this  respect 
represents  increased  efficiency  in  the  use  of  resources.  If  the  total 
expenditures  of  the  gamblers  on  gambling  services  exceeds  the 
commissions  involved,  this  is  equivalent  to  saying  that,  viewed 
as  a  body  of  speculators,  they  engage  in  destabilizing  speculation. 
But  their  losses  are  someone's  gain.  In  the  first  instance,  they 
will  be  the  gain  of  the  initial  participants  in  the  rubber  market. 
Operating  on  the  rubber  market  has  now  become  a  more  at- 
tractive business  since  one  can  now  engage  in  joint  production, 
producing  gambling  services  as  well  as  trading  services.  The 
result  will  be  to  attract  more  people  into  the  activity.  Temporary 
gains  will  be  competed  away  and  the  trading  margin  proper 
reduced,  so  raising  the  average  net  price  of  rubber  to  the  producer. 
But  this  in  turn  will  stimulate  output  and  so  reduce  the  average 
net  price  of  rubber  to  the  consumer.  The  provision  of  gambling 
services  is  now  being  rendered  jointly  by  the  producers  of  rubber, 
the  middle  men,  and  the  consumers  of  rubber.  In  return  for 
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wider  fluctuations  in  price  —  which  are  required  to  provide  the 
gamblers  or  speculators  with  the  uncertainty  they  want  to  bear 
—  the  producer  gets  a  higher  average  price  and  the  consumer 
pays  a  lower  average  price. 

Any  individual  producer  or  consumer  who  disliked  the  wider 
fluctuation  of  prices  could  insure  himself  against  it.  But,  given 
our  assumptions,  it  cannot  be  that  producers  and  consumers 
would  be  willing  to  pay  more  on  the  average  than  the  difference 
between  old  and  new  average  prices  to  insure  themselves  against 
the  wider  fluctuations.  For  this  would  contradict  the  initial 
assumption  that  there  was  a  demand  for  the  services  of  the 
gambling  establishments  at  a  positive  price.  The  people  who 
were  willing  to  make  bets  on  the  price  of  rubber  —  willing  to 
assume  risks  —  would  then  have  found  that  they  were  paid, 
instead  of  having  to  pay,  for  doing  so.  Instead  of  the  market 
being  supplemented  by  gambling  institutions,  it  would  have 
been  supplemented  by  insurance  companies,  insuring  people 
against  the  fluctuations  in  prices. 

I  grant  readily  that  this  picture  of  a  world  in  which  increasing 
fluctuations  in  the  prices  of  commodities  is  a  service  that  com- 
mands a  positive  price  is  hard  to  accept  as  a  valid  description 
of  the  actual  world;  not  so  much  because  people  are  not  willing 
to  pay  for  gambling  —  they  clearly  are  —  but  because  there 
seem  to  be  so  many  cheaper  ways  of  producing  the  gambles 
that  people  want  to  buy,  though  it  must  be  noted  that  some 
of  these  are  illegal  in  many  countries.  However,  this  is  the  picture 
that  is  implicit  in  the  acceptance  of  the  empirical  generalization 
that  destabilizing  speculation  often  occurs  in  practice,  except 
for  such  destabilizing  speculation  as  is  attributable  to  avoidable 
ignorance,  or  as  may  be  consistent  with  deviations  from  our 
initial  assumptions. 

Whether  particular  services  command  a  positive  or  negative 
price  —  are  consumption  services  or  productive  services  —  is  not 
determined  by  physical  or  technical  considerations  alone;  it 
depends  also  on  the  tastes  and  preferences  and  the  capacities 
and  opportunities,  of  the  community  at  large.  Painting  a  fence 
is  generally  regarded  as  a  productive  service  that  must  be  paid 
for,  as  an  activity  yielding  disutility,  and  so  the  price  of  painting 
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a  fence  is  generally  negative;  Tom  Sawyer  was  able  to  reverse 
this  attitude  and  to  make  it  an  activity  yielding  utility;  he  was 
able  to  charge  a  positive  price  for  the  privilege  of  painting  a 
fence.  This  is  the  essential  issue  involved  in  judging  speculation. 
Is  bearing  uncertainty  a  service  that  must  be  paid  for?  Or  a 
privilege  for  which  people  are  willing  to  pay?  Is  speculation  the 
rendering  of  a  productive  service  that  commands  a  reward? 
Or  is  it  a  means  of  gaining  utility  on  which  people  spend  part  of 
their  income?  If  it  turns  out  to  be  the  second  rather  than  the 
first,  is  this  any  reason  for  regarding  it  as  involving  economic 
loss?  Does  not  the  tendency  to  do  so  simply  reflect  the  preconcep- 
tions of  the  academic? 


HAROLD  T.  DAVIS 


The  Influence  of  Interest  Rates 
on  Time  Series  of  Price 


1.  Introduction. 


In  a  paper  that  has  become  classic  1)  Harold  Hotelling  discusses 
the  role  of  differential  equations  in  the  application  of  mathematics 
to  the  problems  of  the  economist.  He  points  out  both  the  analytical 
advantages  in  their  use  and  the  statistical  dangers  which  appear 
when  they  are  applied.  He  admires  the  superb  formulation  of 
the  astronomer  with  his  exact  equations  and  his  accurate  deter- 
mination of  empirical  constants.  This  he  contrasts  with  the  less 
satisfactory  situation  of  the  economist,  who  has  few  rigorous 
laws  to  guide  him  and  who  must  rely  upon  correlations  for  the 
computation  of  his  parameters.  But  Hotelling  points  out  that 
the  astronomer  has  had  the  great  advantage  of  a  "tyrannical 
sun"  which  dominates  the  motion  of  the  planets,  while  the 
economist  must  study  a  system  in  which  the  influence  of  any 
single  factor  is  submerged  in  a  complex  of  factors,  none  of  which 
is  dominant. 

A  problem  in  point  is  that  of  the  role  of  the  rate  of  interest  in 
the  dynamics  of  economic  time  series.  This  factor  has  been 
studied  since  the  time  of  Adam  Smith  and  David  Ricardo. 
Volumes  have  been  written  about  its  influence.  Irving  Fisher 
devoted  a  considerable  part  of  his  scientific  career  to  its  study. 
The  names  of  Walras,  Pareto,  Keynes,  Snyder,  Cassel,  and  many 
others  appear  in  the  bibliography  of  the  subject.  But  in  spite 
of  this  vast  study,  it  is  a  curious  matter  to  observe  that  one 
still  finds  it  very  difficult  to  measure  with  any  degree  of  exact- 
ness the  influence  upon  the  business  cycle  of  a  change,  such, 
for  example,  as  that  of  the  rediscount  rate  made  by  the  board 
of  the  Federal  Reserve  System. 

x)  "Differential  Equations  Subject  to  Error,  and  Population  Estimates," 
Journ.  Amev.  Statistical  Assn.,  Vol.  22   (1927),  pp.  283—414. 
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One  reason  for  this  difficulty  is  to  be  found  in  the  observation 
that  this  factor  seems  to  exert  its  influence  implicitly  through 
a  utility  function,  rather  than  as  an  explicit  variable.  One  knows 
from  abundant  experience  that  an  individual  under  the  duress 
of  severe  financial  need  will  agree  to  the  payment  of  usurous 
rates  of  interest,  but  if  he  is  in  the  possession  of  blue-chip 
collateral,  then  he  is  quite  willing  to  argue  the  rate  with  his 
banker.  What  is  true  for  the  individual  is  also  true  for  business. 
This  is  readily  observed  from  the  fluctuations  in  the  interest 
rates  on  4  to  6  months  commercial  paper,  which  has  varied  in 
this  century  from  7.37  %  in  1920  to  less  than  one  per  cent  in 
recent  years.  Thus  the  rate  of  interest  is  observed  to  be  a  function 
of  the  individual's  or  the  businessman's  money-utility. 

For  this  reason  it  has  seemed  to  the  author  that  some  light 
might  be  thrown  upon  the  problem  if  the  effect  of  the  rate  of 
interest  upon  economic  time  series  was  approached  strictly  from 
the  point  of  view  of  the  utility  function.  Fortunately  for  this 
investigation  a  very  promising  beginning  was  made  by  G. 
Tintner  in  1938,2)  who  introduced  the  interest  rate  into  the 
familiar  problem  of  the  maximization  of  utility  in  a  price- 
commodity  space,  subject  to  the  restraint  of  the  budget  equation. 
A  problem  proposed  by  him,  but  not  solved,  was  the  investigation 
of  second  order  conditions  to  establish  criteria  for  the  existence 
of  a  maximum  for  the  utility  function  thus  extended  to  include 
the  rate  of  interest.  The  model  suggested  for  such  an  investigation 
was  that  of  Hotelling  published  in  1935.3) 

In  the  present  paper  the  author  proposes  to  explore  the  further 
consequences  of  Tintner's  analysis  and  to  integrate  it  with  a 
dynamical  theory  of  prices  introduced  by  him  in  1941.4) 

2.  A  Dynamical   Theory  of  Prices  Which  Includes  the  Effect  of 
Interest 

We  shall  give  a  brief  summary  of  the  principal  result  obtained 

2)  "The  Maximization  of  Utility  over  Time,"  Econometrica,  Vol.  6  (1938) 
154—158. 

3)  "Demand  Functions  with  Limited  Budgets,"  Econometrica,  Vol.  3  (1935) 
pp.  71  ff. 

4)  Analysis  of  Economic  Time  Series,  (Bloomington,  Indiana,  1941),  pp. 
370—385. 
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by  Tintner.  Limiting  himself  to  three  quantities:  x,  y,  and  z, 
he  denoted  the  amount  of  each  consumed  at  time  tbyx(t),  y(t), 
and  z(t)  and  the  corresponding  prices  by  p(t),  q(t),  and  r(t) 
respectively.  He  then  introduced  a  utility  function,  which  we 
shall  denote  by  the  symbol  U,  and  a  force  of  interest  p(t).  The 
utility  was  then  maximized  subject  to  the  budget  restraint 

(2.1)  E  =  x(t)p(t)  +  y(t)q(t)  +  z(t)r(t). 
The  equilibrium  equations  then  assume  the  form 

u'      u'      u'  r  f        i 

(2.2)  -JL=  —1L  =  -JL  =dT  Jexp    -      p(s)ds 

V      ;  p(t)       q{t)       r{t)  Fl     J</W     J 

where  A  is  the  marginal  utility  of  money  at  time  t  =  0.  The 
novel  element  introduced  into  the  f amiliar  problem  is  the  function 
involving  the  force  of  interest. 

We  shall  now  introduce  this  result  into  a  dynamical  theory  of 
prices  which  the  author  proposed  some  years  ago.  The  arguments 
in  support  of  the  theory  have  been  given  elsewhere  and  will  not 
be  repeated  here.  Principal  support  for  the  theory  is  found  in  the 
empirical  evidence  of  its  effectiveness  in  describing  observed 
phenomena  associated  with  the  movement  of  prices,  in  particular, 
the  behavior  of  stock  prices,  including  the  bull  market  of  1929, 
the  subsequent  depression,  and  the  activity  of  the  series  in  the 
period  of  recovery.  But  the  introduction  of  the  factor  of  interest 
into  the  theory  as  proposed  here  is  not  for  the  purpose  of  giving 
further  support  to  the  theory  itself;  it  is  rather  to  see  what  the 
effect  would  be  when  such  an  introduction  is  made.  The  con- 
sequences seem  to  have  some  interest,  at  least  from  the  mathe- 
matical point  of  view,  whatever  one  may  think  of  their  actual 
economic  significance.  And  as  such  they  are  offered  to  the  reader. 

The  dynamical  theory  of  prices  is  based  upon  the  following 
integral  over  some  time  interval:  0  ^  t  ^  T: 


j>- 


(2.3)  /  =        (U  -kA-  niB)dt, 

where  U  is  the  utility  function,  A  is  a  measure  of  the  erratic 
element  in  the  economic  system,  and  B  is  a  measure  of  the  factor 
of  supply  and  demand.  The  quantities  k  and  m  are  constant 
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parameters  of  positive  sign.  A  realistic  definition  of  A  was  found 
to  be  the  sum  of  the  products  of  the  time-rates  of  change  of  goods 
consumed  by  the  time-rates  of  change  of  their  prices.  Thus,  in 
the  notation  used  above  for  the  three-commodity  system  of 
Tintner,  we  write: 

(2.4)  A  =  x'(t)f(t)  +  y'(t)q'(t)  +  z'{t)r'{t). 

Although  this  bilinear  form,  considered  only  mathematically, 
may  have  either  sign,  the  economic  argument  shows  that  it  is 
intrinsically  positive.  Hence  its  introduction  into  the  integral 
(2.3)  reduces  the  value  of  /. 

The  quantity  B  is  defined  to  be  half  the  sum  of  the  squares 
of  the  differences  between  the  quantities  of  good  consumed  and 
the  quantities  produced.  If  we  denote  by  u(t),  v(t),  and  w(t) 
the  quantities  of  x,  y,  and  z  produced,  we  write 

(2.5)  B  =  i[(x  -  uY  +(y-  vY  +  (a  -  w)*\ 

the  factor  \  being  introduced  only  for  convenience. 

Although  there  is  a  tendency  toward  equilibrium  between 
x,  y,  and  z  and  u,  v,  and  w,  their  differences  may  have  either  sign. 
If  we  define  u,  v,  and  w  as  current  inventories  at  time  t,  and  x, 
y,  and  z  as  the  amounts  currently  demanded,  then,  in  times  of 
depressed  business,  (x  —  u),  (y  —  v),  and  (z  —  w)  are  often 
negative;  but  when  business  is  active,  as  in  recent  years,  the  con- 
trary is  true.  The  value  of  B,  however,  is  never  negative  and 
when  this  factor  is  introduced  into  (2.3)  the  value  of  /  in  general 
is  reduced. 

The  primary  assumption  is  then  made  that  prices  are  deter- 
mined in  such  a  manner  that  the  integral  /  is  maximized  over 
time.  This  leads  us  to  the  following  three  equations: 

U:-kA'x-<mB'x=-k^A'j, 

(2.6)  U'v  -kA',-mB'v=-k^ A',, 

U'  -  kA'  -  mB'  =  -  k—A',. 

dt     z 
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If  we  now  introduce  the  values  for  U'x,  U'y,  and  U'z  from  (2.2) 
and  the  values  for  the  derivatives  of  A  and  B  from  (2.4)  and 
(2.5),  we  obtain  the  following  equations:  for  the  determination 
of  the  prices  p(t),  q(t),  and  r(t): 

d2p  r  •   C*  "1 

k-—  +  l  exp    —      p(s)ds  \p  =  m{x  —  u), 

(2.7)  k-jjL  +  A  exp  [-- j  p(s)rfsj?  =  w(y  -  w), 

d2r  r      f  1 

&-— +  Aexp    —      p(s)ds  \r  =  m{z  —  w). 

dt2  L     Jo  J 

Without  essential  loss  of  generality  we  can  assume  that 
k  =  1.  If  we  replace  the  right-hand  members  of  (2.7)  by  the 
symbols  E(t),  F(t),  and  G(t),  and  if  we  use  the  abbreviation: 
R(t)  =  exp  [—  Jlp(s)ds],  then  the  equations  can  be  written: 

dLt  +  XR(t)fi(t)=E(t), 

(2.8)  ^  +  mt)q(t)  =  F(t)t 


XR(t)r{t)  =  G(t). 


d2r 
~di? 

It  is  the  purpose  of  this  paper  to  explore  the  consequences 
of  this  system  of  equations  and  to  observe  the  difference  in  the 
behavior  of  prices  between  the  two  situations:  (1)  when  R(t) 
is  present  in  the  system;  and  (2)  when  R(t)  is  replaced  by  a 
constant. 

It  might  at  first  sight  appear  that  the  problem  as  it  is  for- 
mulated in  equations  (2.8)  is  oversimplified,  since  there  is  no 
explicit  intercorrelation  of  the  three  prices  as  is  usually  the 
case  in  real  markets  and  particularly  for  commodities  which 
are  competing.  Generality  can  be  restored  to  the  system,  however, 
by  adjoining  to  (2.8)  a  functional  connecting  x,  y,  and  z  and 
their  marginal  rates  of  substitution.  The  obvious  complexities 
which  are  introduced  will  be  avoided  in  this  paper  by 
assuming  independence  between  the  variables.  The  general 
problem  is  thus  reduced  essentially  to  the  case  of  a  single  market. 
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3.  Behavior  of  Prices  Under  a  Constant  Positive  Force  of  Interest 
—  Integration  of  the  Differential  Equation. 

We  shall  begin  with  a  simple  market  involving  one  commodity 
and  its  price  as  defined  by  the  first  equation  in  system  (2.8), 
that  is  to  say,  with  the  equation: 

(3.1)  ££  +  XR(t)p{t)  =  B{t). 

Let  us  assume  that  the  force  of  interest  is  a  positive  constant 
and  equal  to  p,  from  which  we  get  R(t)  =  e~pt.  Equation  (3.1) 
then  assumes  the  form 

(3.2)  ?±  +  ze-Ptp{t)  =  E(t). 

The  first  problem  suggested  by  this  equation  is  to  find  the 
behavior  of  the  price  when  E(t)  is  zero,  that  is  to  say,  when 
the  supply  factor  equals  the  demand  factor.  We  are  thus  led 
to  the  solution  of  the  equation 

d2P 

(3.3)  _  +  Ae-^P(*)=-0, 

where  P(t)  is  used  to  denote  the  equilibrium  price. 

This  equation  can  be  integrated  in  terms  of  Bessel  functions. 
To  accomplish  this  we  assume  a  new  independent  variable 


(3 

•4) 

z 

2Vk 
=  — 1 

P 

Q-$P*, 

in 

terms  of  which  we  compute 

dP 

~di  ~ 

,     dP 

d*P 

~a¥~~ 

4/)    Z 

I  dtp    dp\ 


When  these  derivatives  are  substituted  in   (3.3)  and  obvious 
simplifications  made,  the  following  equation  is  obtained: 

,      v  d2P      dP 

(3'5)  Z^  +  lz-  +  ZP  =  °> 

(A/A  Uf4> 

which  is  recognized  as  the  simplest  case  of  Bessel's  differential 
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equation.  Its  general  solution  is  the  function 

(3-6)  P  =  CJ0(z)  +  DY0(z), 

where  J0(z)  and  Y0(z)  are  respectively  the  Bessel  functions  of 
first  and  second  kind  of  order  zero. 

The  general  solution  of  equation  (3.3)  thus  assumes  the  form: 

(3.7)         P(t)  =  CJ,  (^e-*)  +  DY0  (~^>j  ■ 

In  order  to  interpret  this  equation,  we  first  observe  that 
as  t  ->  oo,  we  have 

lim  P(t)  =  C/0(0)  +  DY9(0). 

But  since  /0(0)  =  1  and  Y0(0)  =  -oo  we  must  set  D  =  0  in 
order  to  obtain  a  realistic  description  of  the  equilibrium  price. 
We  are  thus  led  to  a  consideration  of  the  function 

((3.8)  P(t)  =  CJ0(Ke-i»'), 

where  we  use  the  abbreviation:  K  =  2^/X/p. 

As  is  well  known  the  function  J0{z)  has  the  following  asymp- 
totic approximation  provided  z  is  sufficiently  large: 

|3...9)  J0(z)  ~  ]/—  cos  (z  -  in). 

f  nz 

S.ubstituting  Ke~$pt  for  z  in   this  formula,   we   obtain   the 
if  olio  wing  asymptotic  value  for  P(t): 


[(S.m) 


Pit)  ~  C  1/-4-  e*"'  cos  (K  e-^  -  In] 
*  nK 


which  bolds  only  when  the  argument  is  large. 

But  if  we  assume  that  t  —  0  is  the  origin  of  our  time-scale, 
then  the  largest  value  that  z  can  have  is  K.  Since  K  ->  oo  as 
p  ->  0,,  it  is  clear  that  large  values  of  the  argument  are  to  be 
•obtained  only  when  p  is  small. 

We  can  now  formulate  a  few  of  the  characteristics  of  P(t) 
as  follows: 

(a)  If  K  is  sufficiently  large,  that  is  to  say,  if  p  is  small,  then 
P(t)  is  an  .oscillating  function  in  the  neighborhood  of  the  origin. 
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(b)  As  t  increases  the  number  of  zeros  within  any  fixed  interval 
decreases  as  t  increases  and  this  number  ultimately  reduces 
to  zero. 

(c)  The  relative  maxima  of  P(t)  increase  as  t  increases. 

(d)  The  distance  between  relative  maxima  increases  as  t 
increases. 

(e)  P(t)  finally  approaches  C  asymptotically  from  below. 

4.  Behavior  of  Prices  When  the  Force  of  Interest  is  Very  Small. 

A  question  of  considerable  analytical  interest  is  invoked  when  p 
is  zero  or  very  small.  If  p  is  actually  zero,  then  equation  (3.3) 
reduces  to  the  following: 

d2P 

(4.1)  _  +  AP(0=0, 

which  has  the  solution: 

(4.2)  P(t)  =  a  sin  (VJt  +  b). 

But  is  it  very  interesting  to  observe  that  P(t)  as  defined  by 
(3.8)  does  not  reduce  to  P(t)  as  given  by  (4.2)  when  p  ->  0, 
since  K  ->  oo,  and  the  function  approaches  zero  for  all  values 
of  t. 

It  is  important,  however,  to  have  a  solution  of  equation  (3.3) 
when  p  is  small,  that  is  to  say,  a  solution  intermediate  between 

(4.2)  and  (3.8).  Such  a  solution  can  be  developed  formally  as  a 
power  series  in  />,  that  is,  in  the  form: 

(4.3)  P^t)  =  a  sin  (Vlt  +  b)  +  p^(t)  +  P2</>2(t)  +  .  .  .. 

Since  secular  terms,  that  is  to  say,  terms  of  the  form  tn  sin  Vx  t 
and  tn  cos  \fk  t,  are  thus  introduced,  one  has  reservations  about 
the  convergence  of  the  series.  But  it  is  not  difficult  to  show  that 
for  a  limited  range  of  t  and  for  p  sufficiently  small  the  difference 
between  P(t)  as  defined  by  (3.8)  and  the  first  two  terms  of  P0(t) 
can  be  made  arbitrarily  small. 

After  some  effort  the  value  of  ^(t)  was  finally  obtained 
and  PQ(t)  written  in  the  following  somewhat  more  convenient 
form: 
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(4.4; 


P0(t)  =  aQ    cos  crt  -j;  —  (—sin  atf-fotf  cosa/+a^2  sin  otf)  +  . . . 
+  a1    sin  aH (^  sin  vi  —  a/2  cos  aj)  +  •  •  •    » 


where  a0  and  <%  are  arbitrary  constants  and  a2  =  X. 

In  order  to  compare  P(t)  as  defined  by  (3.8)  with  P0(t),  we 
equate  both  functions  and  their  first  derivatives  at  t  =  0.  We 
thus  find 

(4.5)  a0  =  CJ0(K),     a1  =  CJ1(K)} 

where  Jx(z)  is  the  Bessel  function  of  first  order. 

The  magnitude  of  the  error  \P(t)  —  P0(t)\  is  shown  by  the 
following  numerical  approximations: 

Setting  C  =  100,  X  ==.  1,  p  =  0.1,  t  =  2,  we  compute: 

P(*)  =  100/0(18.09675)  =  0.48167,  P0(*)  =  0.49995, 

which  is  quite  satisfactory,  considering  the  comparatively  large 
value  of  p,  namely,  a  force  of  interest  equal  to  10  %. 

5.  Behavior  of  Prices  When  the  Force  of  Interest  is  Negative. 

There  is  nothing  inherent  in  the  theory  of  Tintner  to  prevent 
us  from  assuming  that  the  force  of  interest  in  R(t)  is  negative. 
That  is  to  say,  we  can  write:  R(t)  =  ept,  p  >  0,  and  thus  replace 
equation  (3.3)  by  the  following: 

(5.1)  ^+Xe"'7c(t)  =  0, 

where  n{t)  denotes  the  price  at  time  t. 

The  meaning  to  be  attached  to  this  assumption  is  readily 
seen  from  Tintner's  fundamental  equation,  namely 

(5.2)  7,  +  5,^(1  +  ^-i)  =  E,+  s,<         /  =  1,  2,  .  .  .,  n, 

sn  =  0,  where  Ii  is  the  expected  money  income,  Ej  the  expected 
expenditure,  and  s,  the  expected  savings  at  time  /,  and  sp&  the 
savings  planned  for  time  j  —  1.  These  savings  are  accumulated 
at  an  anticipated  rate  of  interest  *,_.,.  But  as  one  well  knows 
savings  are  not  always  invested  at  a  profit,  in  which  case  i^_,x  is 
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negative.  The  theory  leads  to  the  interesting  conclusion  that  the 
anticipated  marginal  utility  of  money  at  time  /  —  1  may  be 
written 

(5.3)  X  =  XJ[(l  +  H)(l  +  H)  .  .  .  (1  +  Vi)L 

where  Xx  is  the  observed  marginal  utility  at  the  origin  of  the  time 
sequence. 

Assuming  that  the  rates  of  interest  are  positive  over  a  given 
period  of  time,  then  (5.3)  can  be  replaced  by  the  marginal 
utility:  X  =  X1e~pt;  but  in  the  contrary  case,  if  the  rates  of 
interest  are  negative,  then  the  utility  becomes:  X  =  X1ept. 

Before  considering  the  actual  economic  consequences  of  this 
assumption,  let  us  first  consider  the  solution  of  equation  (5.1). 
Observing  that  J$(—z)  =  J0(z),  we  obtain  the  desired  solution 
by  replacing  p  by  —  p  in  equation  (3.8).  We  thus  have 

(5.4)  n(t)  =  CJ0(Keipi), 

where  K  =  2VX/p  and  C  is  an  arbitrary  constant. 

From  the  properties  of  the  function  J0(z)  we  readily  obtain 
the  following  characteristics  of  n(t): 

(a)  7t(t)  declines  in  an  oscillatory  manner  from  an  initial  value 
7t  (t0)  and  the  frequency  of  its  oscillations  increases  as  t  in- 
creases. 

(b)  The  maxima  of  n{t)   are   asymptotic  to   the   function: 

CVyjiKe-**. 

(c)  7i  if)  approaches  zero  as  t  ->  oo. 

6.  Graphical  Representation  of  the  Functions. 

Before  considering  the  economic  realities  of  the  theory  which 
we  have  described,  let  us  examine  the  graphical  representation 
of  the  functions  for  a  few  special  values  of  the  parameters. 

Case  1.  If  p>0,  then  we  have:  P(t)  =  CJ0{Ke-^f),  where 
K  —  2y/J.jp.  Let  us  introduce  arbitrarily  the  values:  C  =  10, 
X  —  0.09,  p  =  0.04,  whence  K  =  15.  This  function  we  shall 
consider  over  the  range:  0  ^  t  ^  150,  that  is  to  say,  we  shall 
evaluate  J0  (z)  between  *  =  15  and  z  =  0.75,  where  z  =  15  e_0-02i. 
Values  of  P(t)  over  this  range  are  contained  in  the  following 
table: 
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t 

z 

P(t) 

t 

z 

P(t) 

0 

15.0 

-0.14 

60 

4.5 

-3.21 

10 

12.3 

-1.11 

70 

3.8 

-4.01 

20 

10.0 

-2.46 

80 

3.0 

-2.60 

30 

8.3 

1.09 

100 

2.0 

2.07 

40 

6.8 

2.89 

125 

1.23 

6.53 

50 

5.5 

0.10 

150 

0.75 

8.64 

The  maxima,  minima,  and  zero  values  of  P(t)  over  the  given 
range  are  contained  in  the  following  table: 


t 

z 

P(t) 

/ 

z 

P(t) 

0  + 

14.93 

0 

38.0 

7.02 

3.00 

6.1 

13.32 

2.18 

50.0 

5.52 

0 

12.0 

11.79 

0 

68.5 

3.83 

-4.03 

19.3 

10.17 

-2.50 

91.5 

2.40 

0 

27.5 

8.65 

0 

From  these  values  the  graph  of  P(t)  is  now  constructed  between 
t  =  0  and  t  =  150  as  shown  in  Figure  1.  In  this  same  figure 
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Fig.   1. 
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there  are  shown  also  the  graphs  corresponding  to  p  =  0.03  and 
p  =  0.06,  which  were  similarly  computed. 

We  observe  from  these  figures  that  P(t)  for  positive  values 
of  p  oscillates  about  the  base  line,  but  ultimately  approaches 
asymptotically  the  line  P  =  10.  We  also  see  that  this  value  is 
approached  more  rapidly  the  larger  p  is  taken  and  that  at  the 
same  time  the  number  of  oscillations  diminishes.  It  is  also  worthy 
of  note  that  as  p  approaches  zero  the  curve  tends  towards  a  sine 
curve,  but  as  the  number  of  oscillations  increases  the  amplitude 
of  the  maximum  values  diminishes. 

Case  2.  If  p  <  0,  then  we  are  concerned  with  P(t)  =  CJ0(Ke%pt). 
Introducing  the  arbitrary  values:  C  =  10,  y/X  =  0.02,  p  =  0.04, 
we  consider  this  function  over  the  range:  0  ^  t  ^  150.  In  this 
case  P(t)  decreases  in  a  sinusoidal  pattern  from  an  arbitrary 
initial  value  of  10  /0(1)  =  7.65  to  a  limiting  value  of  0.  This 
curve  is  shown  in  Figure  2,  in  which  is  also  exhibited  the  graph 
corresponding  to  p  =  0.06. 

,ot:P(t) 


Fig.  2. 

We  now  observe  that  the  number  of  oscillations  increases  as 
p  is  increased  and  that  the  amplitudes  of  the  maximum  values 
diminish  more  rapidly.  The  curve  which  is  asymptotic  to  these 
maxima,  for  p  =  0.06,  is  also  shown  in  the  figure.  Its  equation 
is  readily  found  to  be  the  following: 

2~ 


y 
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7.  Economic  Applicability  of  the  Theory. 

It  not  infrequently  happens  that  a  pleasing  mathematical 
formulation  of  an  economic  problem  may  actually  have  little 
application  to  the  phenomena  which  is  proposes  to  explain. 
Either  the  parameters  may  resist  statistical  estimation;  or,  if 
they  can  be  determined,  the  real  pattern  and  the  mathematical 
pattern  may  be  at  variance  with  one  another.  It  is  thus  a  matter 
of  some  importance  to  put  any  set  of  equations  to  the  test  of 
reality. 

In  the  present  case  we  can  find  at  least  one  period  where  the 
present  theory  appears  to  have  applicability,  namely,  the  years 
between  1917  and  1935.  Within  these  dates  the  movements  of 
the  business  cycle  were  essentially  those  of  a  free  economy, 
which  had  been  stimulated  by  the  war  economy  that  had  preceded 
it  and  which  had  not  yet  been  affected  by  the  intrusion  of 
government  spending,  high  income  taxes,  and  other  regulations 
that  characterized  the  subsequent  period.  Large  variations  both 
in  the  index  of  prices  and  in  the  rate  of  interest  are  to  be  observed. 
The  rise  and  fall  of  stock  prices,  a  balanced  national  budget, 
and  a  decline  in  the  national  debt  were  also  characteristic 
features  of  these  years. 

But  the  reason  why  this  period  is  particularly  adapted  to  the 
present  study  is  found  in  the  observation  that  the  principal 
criterion  of  Tintner's  theory,  the  final  disappearance  of  savings, 
is  satisfied.  To  show  this  we  shall  assume  that  in  1917  a  capital 
fund  of  $  100,000  is  invested  at  the  current  rate  of  interest, 
ij.  While  the  rate  of  interest  may  have  numerous  definitions,  it 
will  be  convenient  to  assume  this  rate  to  be  that  of  4  to  6  months 
commercial  paper.  The  income  thus  derived  in  the  j-th  period 
is  Ij.  From  this  income  an  expenditure  is  taken  which  we  shall 
assume  has  a  fixed  purchasing  value  equivalent  to  $  4,000  in 
1917.  Thus  Ej  =  $  4,000  Pjf  where  Pj  is  an  appropriate  price 
index  with  1917  as  base.  The  general  price  index  of  the  Federal 
Reserve  Bank  of  New  York  City  seemed  to  be  quite  suitable 
for  his  purpose. 

In  order  to  bring  Tintner's  theory  within  the  scope  of  this 
paper  it  is  convenient  to  introduce  an  average  rate  of  interest, 
iA,  in  terms  of  which  we  can  then  write  equation  (5.2)  in  the 
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following  equivalent  form: 


(7.1) 


h 


'}  +  Pi)Sj-i  =  Et 


+  s„ 


where  we  have:  Ij  =  C  ii  -\-sj_1iA,  C  =  the  initial  capital,  and 
Pj  =  ii  —  iA .  As  we  have  assumed  above,  C  —  $  100,000.  The 
average  rate  of  interest,  computed  from  the  formula:  (1  -f-  iA)n  = 
(1  -f  *x)(l  +  H)  •••(!+  in),  was  found  to  be  4.16  %.  With 
these  values  the  following  schedule  was  then  computed: 


Year 

Capital 
fund 

Price  Index 
1913=1.00 

Interest 
rate 

Pi 

h 

(l  +  Pi)*i-i 

E>- 

5; 

1917 

$100,000 

1.39 

4.74o/0 

0.58% 

$4740 

$  o 

$4000 

$  740 

1918 

100,740 

1.57 

5.87 

1.71 

5901 

753 

4520 

2133 

1919 

102,133 

1.73 

5.42 

1.26 

5509 

2160 

4960 

2709 

1920 

102,709 

1.93 

7.37 

3.21 

7483 

2796 

5560 

4719 

1921 

104,719 

1.63 

6.53 

2.37 

6726 

4831 

4680 

6877 

1922 

106,877 

1.58 

4.43 

0.27 

4716 

6896 

4560 

7052 

1923 

107,052 

1.65 

4.98 

0.82 

5273 

7110 

4760 

7623 

1924 

107,623 

1.66 

3.91 

-0.25 

4227 

7604 

4760 

7071 

1925 

107,071 

1.70 

4.03 

-0.13 

4324 

7062 

4880 

6506 

1926 

106,506 

1.71 

4.24 

0-08 

4511 

6511 

4920 

6102 

1927 

106,102 

1.71 

4.01 

-0.15 

4264 

6093 

4920 

5437 

1928 

105,437 

1.76 

4.84 

0.68 

5066 

5474 

5080 

5460 

1929 

105,460 

1.79 

5.78 

1.62 

6007 

5548 

5160 

6396 

1930 

106,396 

1.68 

3.56 

-0.60 

3826 

6358 

4840 

5344 

1931 

105,344 

1.50 

2.64 

-1.52 

2896 

5263 

4320 

3805 

1932 

103,805 

1.32 

2.84 

-1.32 

2998 

3755 

3800 

2953 

1933 

102,953 

1.29 

1.87 

-2.29 

1993 

2885 

3720 

1158 

1934 

101,158 

1.37 

1.14 

-3.02 

1188 

1123 

3960 

-1649 

1935 

98,351 

1.45 

0.91 

-3.25 

— 

— 

— 

— 

It  will  be  clear  both  from  this  table  and  from  the  graphs 
shown  in  Figure  3  that  there  is  a  general  tendency  in  this  period 
for  interest  rates  and  prices  to  move  together.  We  shall  now 
attempt  to  show  that  the  dynamical  theory  which  we  have 
introduced  is  in  satisfactory  agreement  with  the  observed 
behavior  of  prices. 

We  first  observe  that  the  force  of  interest  varies  from  a 
maximum  of  3.21  %  to  a  minimum  of  —3.25  %.  Between  1917 
and  1923  inclusive  it  has  an  average  value  of  1.45;  between  1924 
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and  1929  its  average  is  0.029  and  between  1930  and  1935  its 
average  is  —1.75.  Corresponding  to  each  of  these  periods  we  shall 
now  assume  the  following  values  of  p  :  px  =  0.0145,  p2  =  0.0029, 
and  p3  =  —0.0175. 

(b)    (a) I- 

190        1 


150       -  1 


140       -2 


130      -3 


(a)  Force  of  interest 
(P  =  i-'A) 

(b)  Price  Index  (P);  1913  =  100 
(o)  Prices  computed  from   theory. 


Fig.  3 


If  we  now  denote  by  Px  the  variation  in  price  in  the  first  period 
from  an  average  value  of  1.60,  then  the  theory  states  that  this 
variation  can  be  represented  approximately  by  the  function: 

(7.2)  Py  =  C1J9(-e-J***), 

and  similarly  for  the  third  period  by 
(7-3) 


where  t  and  r'  are  variables  measuring  time  from  an  initial 
value  of  zero. 

Since  the  marginal  utility  of  money  has  presumably  not 
changed  throughout  the  time  interval,  the  constant  k  is  the 
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same  in  each  function.  To  identify  the  initial  value  of  Px  with 
the  observed  value  of  the  actual  price  index,  we  let  Cx  =  1  and 
k/Pl  =  10.  It  then  follows  that  k/\p3\  =  l0-{pj\pji)  =  8.29. 

Since  both  r  and  t'  are  variables  of  time,  but  in  unspecified 
units,  it  is  clear  that  if  time  (t)  is  measured  in  some  standard 
unit  such  as  months,  we  must  write:  t  =  at  -f  b,  r'  =  a't  +  b', 
where  a,  b,  and  a',  b'  are  constants.  In  fitting  (7.2)  and  (7.3) 
to  the  observed  data,  where  t  in  months  varies  from  0  to  204, 
we  find  the  following  transformations  are  necessary:  t  =  0.862^, 
and  t'  =  0.354*  +  0.413.  Introducing  these  values  respectively 
into  (7.2)  and  (7.3),  and  determining  C2  =  2  from  the  observed 
data,  we  obtain  the  following  functions  as  approximations  to 
the  series  of  price  variations  within  the  two  periods: 

(7.4)       Pt  =  /0(10  e-°-00624*),  P3  =  2/0(12.53  e°-0031<). 

In  the  second  period,  since  p2  is  very  small,  a  sine  curve  is 
fitted  to  the  data.  A  satisfactory  representation  is  obtained  from 
the  following  function: 
,(7.5)  P2  =  sin  (0.00229*  -  0.137). 

The  excellence  of  the  approximation  thus  achieved  by  (7.4) 
and  (7.5)  is  readily  seen  from  Figure  3,  where  the  circles  represent 
the  values  computed  from  these  functions. 

8.  Behavior  of  Prices  Under  a  Constant  Force  of  Interest  and  an 

Impressed  Force. 

During  the  period  which  we  have  just  analyzed  in  Section  7 
there  is  to  be  observed  another  price  series  which  does  not 
conform  at  all  to  the  pattern  of  general  prices.  We  refer  to  the 
price  of  industrial  stocks  as  measured  by  the  Dow  Jones  averages. 
The  great  difference  between  the  two  series  is  readily  shown, 
for  example,  in  the  period  between  1923  and  1929  when  the 
Dow  Jones  average  increased  from  100  to  a  maximum  of  340, 
while  the  general  price  index  changed  but  little.  A  similar 
spectacular  difference  is  also  to  be  observed  in  the  period  between 
1950  and  1957,  when  stock  prices,  as  measured  by  the  Dow  Jones 
averages,  increased  from  a  level  of  200  to  a  maximum  well  over 
500.  During  this  period  general  prices  showed  a  modest  increase 
of  approximately  15%. 
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The  original  intention  of  the  author's  theory  of  price  dynamics 
was  to  explain,  first,  an  observed  cyclical  pattern  in  stock  prices 
and,  second,  the  abnormal  rises  which  we  have  just  described. 
Considerable  success,  measured  in  terms  of  the  statistical 
agreement  between  theory  and  observation,  has  attended  this 
explanation.  Since  both  of  the  periods  have  been  characterized 
by  an  abnormal  business  activity  and  since  the  rise  in  stock 
prices  appears  to  be  a  direct  concomitant  of  it,  one  is  led  naturally 
to  seek  a  mathematical  explanation  in  the  non-homogeneous 
equation: 

(8.1)  d^t+Xe-Ptp(t)=.E(t). 

where  E(t)  is  a  measure  of  industrial  activity. 

In  the  author's  original  study,  the  effect  of  the  interest  rate 
was  not  included.  In  spite  of  this  omission  the  dominating 
influence  of  the  force  function,  E  (t) ,  made  possible  a  satisfactory 
explanation  of  the  explosive  prices  of  the  bull  market  through 
the  phenomenon  of  resonance.  But  the  long  cycles  observed  in 
the  movement  of  stock  prices  after  the  collapse  of  the  bull 
market  were  not  adequately  accounted  for  by  the  restricted  theory 
except  by  the  introduction  of  certain  ad  hoc  hypotheses.  It 
seems  highly  probable  that  this  behavior  can  be  accounted  for 
in  a  satisfactory  manner  by  the  introduction  of  the  force  of 
interest. 

The  most  conspicuous  factor  that  was  neglected  in  the  earlier 
study  was  the  influence  of  the  rediscount  rate  of  the  Federal 
Reserve  Board  upon  the  action  of  stock  prices.  This  controversial 
matter  —  the  actual  effect  of  a  change  in  the  rediscount  rate  — 
has  been  difficult  to  appraise  satisfactorily.  Biut  it  seems  quite 
possible  that  equation  (8.1)  may  provide  a  new  and  more 
satisfactory  approach  to  the  problem.  This  can  be  achieved  by 
making  a  comparison  between  the  movement  of  prices  for  a 
given  observed  E(t)  when  p  is  set  equal  to  zero,  and  when  it  is 
given  a  positive  or  a  negative  value.  Unfortunately  the  scope  of 
the  present  paper  precludes  an  adequate  statistical  study  of 
this  engaging  problem,  but  at  least  its  mathematical  formulation 
is  possible. 
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Let  us  begin  by  introducing  the  transformation  of  the  in- 
dependent variable  t  to  the  variable  z  by  means  of  equation  (3.4). 
We  thus  replace  (8.1)  by  the  following: 

d2p         dp  4 

(8.2)  z2  — -  +  z  —  -\-z2p  =  —  E1(z), 
y     ]  dz2  dz F      p2     1W 

where  we  use  the  abbreviation: 

(8.3)  Ex(z)  =  E  [-  log  (^)]  ,     K  =  2Vi/p. 

The  solution  of  equation  (8.2)  can  be  written  in  the  form: 

(8.4)  p(z)  =  AJ0(z)  +  BY0(z) 

$71 

+  ~i 


■  Cz  E  (s) 

J    Uo(s)Y,(z)  -  J0(z)Y0(s)}  -^  ds, 


where  /0(^)  and  Y0(z)  are  respectively  Bessel  functions  of  first 
and  second  kind  of  order  zero.  The  quantities  A  and  B  are 
arbitrary  constants. 

If  p  =  0,  then  (8.4)  is  to  be  replaced  by  the  following: 

1  f< 

(8.5)        p(t)  =  A  sin  cut  +  B  cos  oit  -\ sin  a (t  —  s)E(s)ds, 

a  Jo 

where  A  and  B  are  arbitrary  constants  and  a2  =  X.  As  in  the 
homogeneous  case,  there  appears  to  be  no  obvious  way  by 
means  of  which  we  can  pass  from  (8.4)  to  (8.5)  by  a  limiting 
process  when  p  ->  0. 

Returning  to  equation  (8.4),  we  see  that  it  is  possible  to  find 
the  price  which  corresponds  to  a  given  impressed  force,  but 
there  are  obvious  difficulties  of  integration  which  make  the 
determination  of  price  by  this  means  impractical  in  general. 
Except  for  a  few  cases,  which  are  unrealistic  in  the  present 
problem,  the  evaluation  of  the  right  hand  member  of  (8.4)  must 
be  achieved  by  the  methods  of  mechanical  integration. 

It  should  be  observed,  however,  that  if  an  observed  price  is 
given,  the  problem  of  finding  the  impressed  force  which  generates 
it,  is  much  easier  to  solve.  In  this  case,  we  can  regard  both  (8.4) 
and  (8.5)  as  Volterra  integral  equations,  from  the  inversion  of 
which  the  desired  function  is  obtained.  Since,  moreover,  the 
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differential  equivalents  of  the  two  equations  are  known,  the 
solution  of  the  problem  is  readily  obtained. 

Thus,  the  solution  of  equation  (8.1)  can  be  written: 

(8.6)  p(t)  =  P0(t)  +  Q(t), 

where  P0{t)  is  the  price  when  E(t)  =  0,  and  Q(t)  is  the  part  of 
price  which  is  generated  by  the  force  function.  If,  then,  we 
denote  by  L(u)  the  operator 

(8.7)  £(„)  =  _  + a  e->««(0. 

equation  (8.1)  can  be  written  in  the  form: 

(8.8)  E(t)  =  L(P  +  Q)=  L(P)  +  L(Q)  =  L(Q). 

Therefore,  given  an  observed  price  series,  it  is  possible  to 
obtain  from  (8.8)  the  market,  which  would  lead  to  such  a  series. 
A  few  examples  are  given  below  as  follows: 

Case  1.  p  =  P  -f  K,  where  K  is  a  constant.  E(t)  =  K  e~pt. 
In  this  case,  the  market  starts  with  a  demand  in  excess  of  supply, 
but  equality  between  demand  and  supply  is  attained  exponential- 
ly.  (Figure  4). 

Case  1 

Q(t) 


E(t) 


INFLUENCE  OF  INTEREST  RATES  ON  TIME  SERIES  OF  PRICE 


161 


Case  2.  p  =  P  +  A  +  Bt  +  Ct2,  where  A,  B,  and  C  are 
positive  constants.  E(t)  =  2C  +  Xe-P*(A  +  Bt  +  Ct2).  In  this 
case  the  market  starts  with  a  demand  in  excess  of  supply, 
which  ultimately  reaches  a  point  of  constant  equilibrium  in 
which  the  difference  between  the  quantities  demanded  and  those 
supplied  is  constant.  Prices  then  increase  according  to  a  quadratic 
law  and  the  effect  of  the  interest  rate  is  finally  absorbed.  (Fi- 
gure 4). 

Case  3.  p  =  P  +  A  +  Bt,  A,  B  >  0.  E(t)  =  X  e-*>*(A  +  Bt). 
In  this  case  the  market  starts  with  a  demand  in  excess  of  supply, 
which,  after  rising  to  a  maximum  value,  decreases  exponentially 
until  equality  is  approximated  between  the  two  factors.  Prices 
rise  linearly  and  ultimately  absorb  the  effects  of  the  interest 
rate.   (Figure  5). 


Case  4.  p  =  P  +  A  +  Bt,  A  >  0,  B  <  0.  E(t)  =  X  e^* 
(A  +  Bt).  As  in  Case  3,  the  market  starts  with  demand  in  excess 
of  supply,  but  the  demand  at  once  declines  exponentially  until 
equality  is  reached.  Thereafter,  the  supply  is  slightly  in  excess 
of  demand,   but  the  difference  diminishes  asymptotically  to 
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zero.  Prices  themselves  decline  linearly  until  the  effect  of  the 
interest  rate  is  finally  absorbed.   (Figure  5). 

Case  5.  j>  =  P  +  K  +  A  sin  (kt  +  c),  K  >  0.  E(t)  =  -Ak2 
sin  (kt  -f-  c)  -\-  K  -\-  A  sin  (kt  +  c)  e~pt.  In  this  case  the  market, 
which  starts  with  a  sinusoidal  movement  about  a  constant 
positive  demand,  finally  damps  into  a  pure  sinusoidal  movement 
in  which  there  is  at  one  time  an  over  supply  and  in  another 
excess  demand. 

Case  6.  (p  =  0).  p  =  —(A  fiat.)  t  cos  at,  a2  =  L  E(t)  =  ^ 
sin  a£.  In  this  case  we  have  an  example  of  the  effect  of  resonance. 
A  sinusoidal  market,  which  has  the  same  period  as  that  of  the 
free  system,  generates  as  explosive  price.  The  bull  market  of 
1929  is  readily  explained  by  this  phenomenon,  as  has  been 
indicated  earlier  in  this  paper. 

Something  akin  to  this  situation  is  also  observed  when  p  is 
positive.  Thus  if  we  assume  that  p  =  tj0(z),  where  z  is  defined 
by  (3.4),  then  we  find  that  E(t)  =  pzj^z),  where  Jx(z)  is  the 
Bessel  function  of  first  order.  At  t  increases  z  approaches  zero 
and  J0(z)  approaches  unity.  Hence  p  increases  without  limit. 
But  as  z  approaches  zero,  Jx(z)  also  approaches  zero  and  E(t) 
is  ultimately  zero.  We  thus  have  an  explosive  price  with  a  force 
function  that  decreases  asymptotically  to  zero. 


PART  IV 

On  the  Edgeworth  Taxation  Paradox 


WILLIAM  VICKREY 


Can  Excises  Lower  Prices? 


In  the  land  of  ceteris  paribus,  that  kindergarten  of  economic 
theory  where  one  thing  changes  at  a  time  while  all  else  remains 
frozen  on  the  spot  as  in  a  game  of  "redlight,"  it  is  a  commonplace 
that  if  a  tax  is  placed  upon  an  article,  its  price  to  the  consumer 
will  be  raised,  or  will  at  most  remain  the  same.  This  proposition, 
true  enough  when  "other  things"  are  "equal",  in  some  sense, 
has  quite  generally  been  supposed  to  apply  also  in  the  real  world 
where  "ceteris"  are  notoriously  reluctant  to  remain  "paribus". 
To  be  sure,  it  is  difficult,  offhand,  to  produce  an  example  from 
real  life  that  can  confidently  be  said  to  controvert  this  prop- 
osition. But  the  complacency  with  which  economists  had  been 
extending  their  results  from  the  partial  equilibrium  world  of 
ceteris  paribus  to  the  real  world  of  complex  interrelations  was, 
for  the  more  sophisticated  of  them,  rather  rudely  shocked  when 
Edgeworth,  in  1897,  managed  to  produce  an  apparently  reasonable 
and  consistent  example  of  a  case  where  a  tax  on  a  certain  class 
of  railway  ticket  could  induce  a  profit-maximizing  monopolist 
to  reduce  the  rate  for  all  classes  of  tickets,  inclusive  of  tax.1) 

However,  the  effect  of  this  paradox  on  economic  thinking  has 
been  considerably  less  devastating  than  might  have  been 
expected.  The  model  of  necessity  involved  considerable  mathe- 
matics, and  those  outside  the  inner  circle  of  economists  versed 
in  mathematics  were  prone  to  suspect  some  piece  of  legerdemain 
concealed  in  the  works.  Furthermore  the  example  involved 
monopoly,  and  at  that,  a  type  of  monopoly  generally  subjected 
to  public  control  rather  than  left  to  maximize  its  profit  as  best 
it  might,  so  that  even  those  few  economists  who  were  familiar 

x)  F.  Y.  Edgeworth,  "La  Teoria  Pura  del  Monopolio,"  Giornale  degli  Eco- 
nomist, Vol.  15  (October,  1897),  pp.  307—320,  esp.  p.  317.  "The  Pure  Theory 
of  Taxation",  Economic  Journal,  VII  (June,  1897)  231;  reprinted  in  Papers 
Relating  to  Political  Economy   (London,   1925),  I,   132,   143;  II,  93,  401. 
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with  the  paradox  were  inclined  to  treat  the  example  as  a  curio- 
sum,  not  likely  to  have  any  real  life  counterpart.  The  reaction 
of  Seligman  is  perhaps  typical:  "The  mathematics  which  can 
show  that  the  result  of  a  tax  is  to  cheapen  the  untaxed  as 
well  as  the  taxed  commodities  will  surely  be  a  grateful  boon  to 
the  perplexed  and  weary  secretaries  of  the  Treasury  and  ministers 
of  finance  throughout  the  world!"2) 

It  remained  for  Harold  Hotelling  to  show,  in  1932,  that  the 
phenomenon  was  also  possible  under  conditions  of  perfect 
competition;  but  again  the  rather  formidable  looking  mathe- 
matics involved  tends  to  scare  away  most  economists.  All  that 
Hotelling  says  in  non-mathematical  terms  about  the  conditions 
that  are  required,  if  a  tax  levied  on  one  of  two  related  commod- 
ities is  to  lower  both  their  prices,  is  that  the  two  commodities 
must  compete  both  in  production  and  in  consumption.  But 
obviously  this  is  not  enough  of  itself  to  produce  the  paradox, 
and  we  cannot  say  that  the  phenomenon  is  likely  merely  because 
a  large  number  of  commodities  satisfy  these  conditions.  Some- 
thing more  is  obviously  required.  It  is  the  purpose  of  this  article 
to  set  forth  with  the  aid  of  as  elementary  mathematics  as  is 
possible  just  what  that  something  more  is. 

Unfortunately  there  seems  to  be  no  way  to  avoid  some 
recourse  to  mathematics.  But  I  will  try  to  come  out  with 
something  more  than  a  mere  set  of  equations.  To  simplify  the 
discussion  as  much  as  possible,  it  will  be  assumed  that  we  are 
dealing  with  only  the  two-commodity  case  under  conditions  of 
perfect  competition.  That  is,  we  will  consider  only  the  case 
where  the  paradox  can  arise  through  the  interrelations  between 
only  two  commodities,  and  cases  involving  longer  and  more 
complicated  chains  of  relations  will  be  left  aside  for  the  moment. 
Also,  cases  requiring  a  greater  or  lesser  degree  of  monopoly 
will  be  ignored.  Moreover,  it  will  further  simplify  matters  to 
assume  that  the  importance  of  the  commodities  considered  in  the 
budgets  of  individuals  is  so  small  that  changes  in  the  marginal 

2)  E.  R.  A.  Seligman,  Shifting  and  Incidence  of  Taxation  (4th  ed.;  New  York, 
1921),  p.  214n. 

3)  Harold  Hotelling,  "Edgeworth's  Taxation  Paradox  and  the  Nature  of 
Demand  and  Supply  Functions",  Journal  of  Political  Economy,  XL  (1932), 
577—616. 
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utility  of  money,  i.e.,  the  "income  effect"  can  be  neglected  and 
that  individuals  are  able  to  devote  more  or  less  of  their  resources 
to  the  particular  commodities  considered  without  large  changes 
in  standard  of  living. 

It  can  be  shown  that  the  existence  of  the  Edgeworth  paradox, 
for  sufficiently  small  rates  of  tax,  depends  only  on  the  second 
derivatives  of  the  utility  and  cost  functions,  which  correspond  to 
the  first  derivatives  of  the  demand  and  supply  functions. 
Accordingly  we  may  approximate  the  utility  and  cost  functions 
in  the  neighborhood  of  the  equilibrium  point  (with  no  taxes) 
by  functions  of  the  second  degree  only.4) 

Accordingly  we  may  write: 
(1)  W  =  a{x  -  x)*  +  2b(x  -x){y-y)  +  c(y  -yf 

+  px(x-x)+py(y-y)+W 
for  the  total  cost  function,  W  being  the  total  cost  of  producing 
amounts  x  and  y  of  the  two  commodities,  the  remaining  symbols 
being  constants.  If  x  and  y  are  the  quantities  produced  under 
equilibrium  conditions  with  no  tax,  then  W  is  the  cost  of  producing 
this  equilibrium  output.  Under  competitive  conditions  the  price 
realized  by  producers  will  be  equal  to  marginal  cost,  so  that  we 
can  put 

dW       _ 

px  =  Mx  =  ~dx  =  px  +  2a^x  "  ^  +  2b{y  ~  S) 

(2) 


py  =  My  =  —  =  py  +  2b(x  -  x)  +  2c(y  -  y) 


dw 

dy 

Thus  the  constants  px  and  py  are  the  equilibrium  prices  obtaining 
in  the  absence  of  tax,  justifying  the  symbol  used.  It  will  be 
shown  in  due  course  that  all  of  the  constants  must  be  positive. 
Indeed  for  an  equilibrium  in  which  sellers  assume  prices  to  be 
constant,  the  marginal  cost  of  x  must  increase  with  increasing 
x,  and  similarly  for  y,  so  that  the  constants  a  and  c  are  im- 
mediately seen  to  be  positive.  A  third  requisite  for  the  equilib- 
rium to  be  stable  is  that  ac  exceed  b2.  This  condition  corresponds 
to  the  increasing  marginal  rate  of  substitution,  or  that  the  curves 
of  constant  cost  in  the  x,  y,  plane  be  concave  towards  the  origin. 

4)  For  the  completely  general  treatment  the  reader  is  referred  to  the  Hotelling 
article  cited  above. 
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These  conditions  are  necessary  in  order  that  the  producer  be  at 
a  point  of  maximum  profit  rather  than  of  minimum  profit  or 
possibly  at  a  minimax  or  saddle  point  (which  would  admit  of 
increasing  profits  by  changing  output,  assuming  prices  to  remain 
constant.) 

Similarly  we  may  write 

(3)       U  =  -A{x-xY-2B{x-x){y-y)-C{y-yY 
+  Px(x-^x)  +  Py(y-y)  +  U 

for  an  aggregate  utility  function,  the  utility  function  of  each 
individual  being  added  in  with  such  weight  as  to  make  the 
marginal  utility  of  money  unity  for  all  persons.  With  total 
utility  thus  measured  in  terms  that  make  the  marginal  utility 
of  money  one,  we  can  equate  prices  to  consumers  with  marginal 
utilities  as  follows: 

2A(x-x)-2B{y-y) 

■J  3: 

2B(x-x)  -2C(y-y) 

Again  we  see  that  Px  and  Py  turn  out  to  be  the  prices  obtaining 
at  the  equilibrium  consumption  x  and  y.  The  constants  A  and  C 
must  be  positive,  if  the  marginal  utility  of  x  is  to  decrease  with 
increasing  x,  and  similarly  for  y.  Further,  for  consumers  to  be 
in  stable  equilibrium  when  confronted  with  the  opportunity 
to  buy  at  prices  they  consider  constant,  we  must  have  AC  greater 
than  B2.  This  corresponds  to  the  diminishing  marginal  rate  of 
substitution  for  consumers,  and  insures  that  the  indifference 
curves  in  the  xy  plane  will  be  concave  away  from  the  origin. 
These  conditions  are  necessary  in  order  that  consumers  be  at  a 
point  of  maximum  utility  rather  than  at  a  minimum  or  a 
minimax,  assuming  that  prices  are  to  be  constant.  If  these 
conditions  are  not  satisfied  it  would  be  possible  for  consumers 
to  improve  their  position  by  moving  away  from  the  equilibrium 
point  in  a  proper  direction. 

Now  the  price  to  the  consumer  will  exceed  the  price  realized 
by  the  seller  by  the  amount  of  the  tax,  so  that  we  have 

(5)     px  +  tx  =  Px>  pv  +  ty  =  Py\  px  =  Px,  py  =  Py. 


p* 

du 

dx 

-u. 

=  P* 

Py 

dU 
dy 

-uv 

=  P, 
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Solving  these  equations  for  the  tax  rates,  and  making  use  of  (2) 
and  (4),  we  have 


(6) 


tx  =  Px-i>x=  -2(A  +  a)(x  -  x)-2(B  +  b)(y  -  y) 


tv  =  Py-p,=  -2(S  +  b)(x-  x)-2(C  +  c)(y  -  y). 

These  equations  can  be  solved  for  x  and  y  in  terms  of  the  tax 
rates,  giving  us 


x  =  x 
(V) 


y  =  y 

(8) 

(9)        where  D  = 


2 

tx       B  +  b 

ty                C    +    C 

4 

A  +  a   B  +  b 
B  +  b    C  +  c 

=  *  ~~  2D  [tx{C  +  °' 


tv(B  +  b)] 


2 

A  +  a      tx 
B  +  b      iv 

4 

A  +  a  b  +  b 
B  +  b   C  +  c 

=  9  ~  2D  [lM  +a)~  tx(B  +  b)] 


A  +  a  B  +  b 
B  +  b  C  +  c 


=  (A+a){C  +  c)-{B+b)* 


these  expressions  can  in  turn  be  substituted  for  x  and  y  in  equa- 
tions (4),  yielding  the  following  expressions  for  the  prices  to 
consumers  in  terms  of  the  taxes: 


p   =  p   j 


tx  B  +  b 

B 

i 

A  +  a  tx\ 

ty       C    +     C 

1  D 

B  +  b  ty\ 

(10) 


=  Px+^[A(C  +  c)-B{B  +  b)]+^(aB-Ab) 


Py  + 


t,  B  +  b 

ty     C  +  C 
t. 


+ 


A+a  t, 
B  +  b  t. 


--  Py  +  j,  \C{A  +  a)-B(B  +  6)]  +  ^  (Be  -  bC). 


Consider  the  effect  of  imposing  a  tax  on  x.  If  this  is  to  diminish 
the  prices  of  both  x  and  y,  we  must  have,  from  (10) 


(n; 


dp 


L[A{C+c)-B(B+b)]<0;  d-l?  =  ±(Bc-bC)<0 


D,  however,  can  be  shown  to  be  positive.  For  D  can  be  written: 
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(12)       D=  {AC  —  B2)+(ac  -  b2)  +  (aC  +  Ac  -  2Bb); 

the  first  two  parentheses  are  positive  by  reason  of  the  maximum 
conditions;  in  the  final  parenthesis  we  have: 


(13)  aC  +  Ac>  2\/aCAc  =  2VACVac  >  2Bb, 

so  that  all  three  parentheses  are  positive  and  thus  D  is  positive. 
Hence  the  conditions  for  a  tax  on  x  to  increase  the  prices  of 
both  x  and  y  are: 

(14)  A(C  +  c)-  B{B  +  b)  =  AC  -  B2  +  Ac-  Bb  <  0 

(15)  Be  -bC  <  0. 

To  show  that  b  and  B  are  positive,  we  multiply  (14)  by  C 
(which  is  positive),  and  add  and  subtract  B2c,  getting: 

(16)  c{AC-B2)+ACc-BCb-B2c+B2c  <  0, 

or  (C+c){AC-B2)  +  B{Bc-bC)  <  0. 

In  this  last  expression  the  first  two  parentheses  are  both  positive, 
and  the  last  parenthesis  is  negative  from  (15);  thus  if  the  whole 
expression  is  to  be  negative,  B  must  be  positive.  If  B  is  positive, 
then  from  (15)  b  must  also  be  positive.  This  means  that  the  two 
commodities  x  and  y  compete  both  in  production  and  consump- 
tion. That  is,  the  more  of  one  that  is  consumed,  the  smaller  will 
be  the  marginal  utility  of  the  other,  and  the  more  of  one  that  is 
produced,  the  greater  will  be  the  marginal  cost  of  producing 
the  other.  This  is  the  condition  verbalized  by  Hotelling.  Com- 
bining the  conditions  (14)  and  (15)  with  the  conditions  that  a, 
b,  c,  A,  By  and  C  all  be  positive,  and  the  convexity  conditions 

(17)  ac  -  b2  >  0;  AC  -  B2  >  0 

we  can  obtain  the  following  chain  of  relationships: 
a       ac—b2       b 

,     x         b  be  c 

(18 

v     ;  AC-B2 


b       A      a 
c       B 

IC-B 
Be 

2      A 

B      B 

c>c- 

BC 

In  this  chain,  the  first  relation  is  an  identity,  the  second  follows 
from  (17)  (plus  the  fact  that  all  the  coefficients  are  positive); 
the  third  follows  from   (14),  the  fourth  again  from   (17),  the 
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fifth  is  an  identity,  and  the  sixth  follows  from  (17).  (15)  is 
needed  only  to  show  that  the  coefficients  are  actually  positive, 
and  in  fact  if  all  coefficients  are  positive,  (15)  follows  immediately 
from  the  last  four  relations  in  the  chain.  That  is,  if  a  tax  on  x 
lowers  the  price  of  x,  and  the  commodities  compete  in  production 
and  in  consumption,  the  tax  on  x  will  also  lower  the  price  of  y. 
It  is  not  enough,  however,  to  produce  the  paradox,  to  say  that 
the  two  commodities  must  compete  in  consumption  and  produc- 
tion. To  examine  these  conditions  further,  we  can  rewrite  the 
utility  and  cost  functions  in  such  a  way  as  to  split  up  the  two 
commodities  into  two  independent  factors.  That  is,  we  can  think 
of  the  production  of  the  commodities  as  being  carried  on  in, 
say,  two  independent  stages,  one  stage  being  common  to  both 
products,  and  the  other  production  process  being  applied  to 
only  one.  For  example,  one  can  consider  as  two  separate  products 
ground  coffee  (x)  and  coffee  in  the  bean  (y),  the  growing,  curing, 
and  roasting  of  the  coffee  being  one  process  applied  to  both 
products  and  the  grinding  a  supplementary  process  applied  only 
to  the  x.  If  these  processes  are  independent  in  the  sense  that 
the  volume  of  one  process  has  no  effect  on  the  costs  involved  in 
the  other,  then  we  should  be  able  to  write  W=f(x)-\-g(y-\-<x.x), 
where  a  is  the  number  of  units  of  x  that  correspond  to  a  unit 
of  y  in  the  common  process;  in  the  case  of  ground  and  bean 
coffee,  this  is  obviously  pound  for  pound,  or  unity.  In  general, 
we  find  that  in  order  for  the  processes  to  be  independent,  we 
must  put 

b 
(19) 

c 

and  we  can  write  the  cost  function  as: 


ac—b2  I  _         b  _       _  ac—b2 

W= x 


(20) 


/  .        b  _         ac-b2   \         (bV 
{px--py-2-^-x)x+c[y+-x) 

(pv~2bx-2cy)  [y+  -*)  +  Wo, 


where  W0  represents  the  fixed  costs  at  zero  output.  In  general 
the  smaller  the  term  (ac  —  b2)jc,  the  less  variable  will  be  the 
marginal  cost  of  the  separate  process.  This  corresponds  to  the 
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fact  that  the  more  constant  the  marginal  cost  of  the  supplemen- 
tary process  the  more  closely  competitive  the  products  will  be  as 
to  supply. 

Similarly,  we  can  decompose  the  utility  function  by  considering 
that  there  is  a  common  service  that  can  be  rendered  indifferently 
by  either  commodity,  but  that  there  is  an  additional  service 
provided  by  one  (x)  but  not  by  the  other  (y).  Thus  we  may 
assume  for  present  purposes  that  either  coffee  ground  at  the 
store  or  coffee  ground  on  the  premises  will  serve  to  produce  a 
satisfactory  beverage,  but  that  having  the  coffee  ground  at  the 
store  provides  an  additional  satisfaction  in  terms  of  convenience. 
If  these  two  satisfactions  are  to  be  independent  of  each  other, 
then  BjC  must  be  the  amount  of  the  x  necessary  to  substitute 
for  one  unit  of  the  y  in  terms  of  the  common  satisfaction  (note: 
this  is  not  the  same  as  the  marginal  rate  of  substitution,  as  it 
refers  only  to  a  part  of  the  total  satisfaction  derived  from  the  x). 
Thus  we  rewrite  (3)  as  follows: 

(AC-B*\    m     I  5  AC-B*    \  /        B    Y 

(21)  /        B    \ 

+  (Pv  +  2Bw  +  2Cy)\y+-x)+U0, 

where  U0  represents  the  level  of  utility  when  no  x  or  y  is  consumed. 
Now  we  see  from  (18)  that  this  ratio  BjC  must  be  less  than  the 
ratio  b/c  obtaining  in  the  common  manufacturing  process.  That 
is,  if  in  the  growing  and  roasting  of  coffee  the  equivalence  is 
pound  for  pound,  then  in  the  brewing  of  coffee  it  is  necessary, 
for  the  Edgeworth  phenomenon  to  occur,  that  more  of  the  pre- 
ground  coffee  be  required  to  produce  a  given  number  of  cups  of 
beverage  of  a  given  quality  than  of  the  bean  coffee  ground  on 
the  spot,  in  this  case  a  not  unreasonable  hypothesis  owing  to 
loss  of  strength,  and  possibly  wastage  where  the  remainder  of 
a  supply  becomes  unfit  for  use. 

Here  then  emerges  a  more  or  less  plausible  pattern  where  the 
Edgeworth  phenomenon  would  be  likely  to  occur,  under  com- 
petitive conditions.  One  can  envisage  a  situation  in  which  con- 
sumers are  on  the  whole  willing  to  pay  a  certain  premium  for 
the  convenience  of  not  having  to  grind  coffee  at  home,  but  will 
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to  a  large  extent  forego  this  convenience  if  its  cost  is  much 
increased;  under  these  circumstances  a  tax  on  ground  coffee, 
raising  the  cost  of  this  convenience,  might  lead  to  a  substantial 
shift  from  ground  coffee  to  bean  coffee.  But  since  there  is  less 
wastage  with  the  bean  coffee,  the  total  demand  for  the  growing 
and  roasting  of  coffee  may  decline,  and  is  particularly  likely 
to  do  so  if  the  demand  for  coffee  as  a  beverage  is  inelastic.  If 
the  supply  of  coffee  is  inelastic  (as  was  in  fact  the  case  prior 
to  the  coffee  valorization  schemes  of  Brazil)  this  diversion  to  a 
more  efficient  (but  less  satisfying)  method  of  utilization  may  well 
reduce  the  price  of  the  green  coffee  and  with  it  the  price  of  bean 
coffee  to  the  point  where  it  would  more  than  offset  the  tax  on 
that  portion  that  is  still  pre-ground.  More  cups  of  coffee  would 
be  consumed,  but  fewer  pounds. 

The  essential  requirement,  then,  when  the  matter  is  analyzed 
in  this  way,  is  that  more  of  the  taxed  article  be  required  to 
replace  one  unit  of  the  untaxed  article  in  providing  for  the 
common  element  of  satisfaction  to  the  consumer  than  can  be 
procured  in  exchange  for  one  unit  of  the  untaxed  article  in  the 
common  production  process.  Moreover,  the  difference  between 
these  substitution  ratios  must  be  greater,  the  less  perfect  is  the 
substitution  in  consumption  between  the  two  commodities,  and 
the  more  elastic  is  the  output  of  the  common  production  process. 
For  we  can  rewrite  (14)  as  follows: 

Bb  >  AC  -  B*  +  Ac 

We  may  take  (AC  —  B2)/B2  as  an  index  of  the  degree  of  im- 
perfect substitution,  being  zero  when  the  two  commodities  are 
perfect  substitutes  and  infinite  when  they  are  completely 
independent;  the  factor  (1  +  Cjc)  reflects  the  elasticity  of  demand 
for  the  untaxed  commodity,  relative  to  that  of  its  supply. 

It  is  not  necessary,  however,  to  treat  the  taxed  commodity 
as  the  one  which  undergoes  a  second  process  or  the  one  which 
provides  a  second  type  of  satisfaction.  The  untaxed  commodity 
may  equally  well  be  regarded  in  this  light.  In  this  case  the  amount 
of  the  taxed  commodity  equivalent  to  a  unit  of  the  untaxed  is 
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ajb  in  the  common  production  process,  and  A/B  in  the  common 
form  of  satisfaction;  as  before,  it  must  require  relatively  more 
of  the  taxed  article  to  take  the  place  of  a  unit  of  the  untaxed 
in  giving  the  common  satisfaction  than  is  produced  in  the  com- 
mon process  with  the  resources  required  to  produce  one  unit 
of  the  untaxed  item.  An  example  of  this  type  might  be  plain  and 
creosoted  railroad  ties:  it  may  require  two  or  three  plain  ties 
to  give  as  many  years  of  service  as  one  creosoted  tie,  and  in 
addition  the  labor  of  replacement  is  saved,  whereas  the  cost  of 
cutting  ties  may  be  much  the  same  whether  or  not  the  ties  are 
later  to  be  treated  with  preservative  in  a  separate  process.  A 
tax  on  the  commodity  that  loses  in  its  relative  importance  as 
we  pass  from  the  common  process  (cutting)  to  the  common 
service  (rail-bearing)  (in  this  case  the  plain  tie),  might  well 
induce  the  substitution  of  more  treated  ties,  decrease  the  number 
of  ties  needed  for  a  given  amount  of  service,  and  if  the  supply 
of  tie  lumber  is  inelastic,  might  conceivably  drive  the  price  of 
tie  lumber  down  by  more  than  the  tax.  This  requires  of  course 
not  only  that  the  supply  of  tie  lumber  be  inelastic  so  that  a  drop 
in  demand  will  cause  a  substantial  drop  in  price,  but  also  that  the 
supply  of  the  treatment  process  be  reasonably  elastic  so  that  the 
cost  of  this  processing  will  not  go  up  too  much  when  the  demand 
is  increased  as  a  result  of  the  tax. 

In  cases  of  this  type  we  may  derive  from  (18)  the  following 
expression  for  the  required  difference  between  the  ratios: 

a       A       ac  -b2      AC  -  B2 
(23)  J-B>^c bT~ 

Here  we  see  that  the  required  difference  becomes  larger  the  less 
perfect  the  competition  between  the  commodities  either  in  con- 
sumption or  in  production. 

It  would  also  be  possible  to  have  a  situation  where  one  of  the 
two  commodities  underwent  the  extra  processing  while  the  other 
conferred  the  extra  benefit.  Realistic  examples  of  this  are  a  bit 
hard  to  find,  but  possibly  a  number  of  food  preserving  practices 
might  come  under  this  heading.  The  preserving  process  may  be 
considered  to  reduce  spoilage  and  thus  reduce  the  amount  of 
raw  produce  required  to  provide  a  given  number  of  calories  to 
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the  ultimate  consumer,  as  compared  with  unpreserved  produce. 
Some  of  the  processes  appear  to  be  in  varying  degrees  deleterious, 
so  that  fresh  produce  has  a  separate  utility  attached  to  it  in 
the  shape  of  the  absence  of  these  deleterious  effects.  If  consumers 
were  adequately  aware  of  these  factors  and  willing  to  pay  a 
premium  price  for  produce  not  so  processed,  an  Edgeworth 
phenomenon  might  be  obtainable  by  putting  a  tax  on  the  non- 
preserved  varieties,  inducing  a  shift  to  the  preserved  varieties, 
reducing  the  demand  for  raw  produce,  and  driving  the  price 
of  raw  produce  down,  possibly  by  more  than  the  tax. 

There  is  also  the  symmetrical  possibility  of  a  tax  raising  the 
price  to  the  buyer  by  more  than  the  tax,  and  thus  raising  the 
price  received  by  sellers  for  both  the  taxed  and  untaxed  items. 
Instead  of  (18),  for  this  case  we  have,  for  a  tax  on  y,  the  con- 
ditions 


(24) 


a       ac—b2       b        b       A       ac—b2       A 
~b=     be     +  7  >  7  >  £  +     Be     >~B 

_AC-B2      B       B 

~      BC       [~C>~C' 


The  only  difference  is  in  the  fourth  expression,  and  in  the  fact 
that  the  tax  is  now  on  y  rather  than  x. 

A  possible  instance  of  this  sort  is  the  margarine  tax:  the 
acreage  required  to  produce  a  pound  of  butter  appears  to  be 
substantially  greater  than  that  required  to  produce  a  pound  of 
margarine,  while  the  substitution  nutritionally  is  pound  for 
pound;  if  the  demand  for  table  fats  is  very  inelastic,  the  tax 
on  margarine  might  mean  that  more  acreage  would  be  required 
to  supply  the  demand  for  table  fats,  and  land  prices,  rural  rents, 
fodder  prices,  and  the  price  of  margarine  ingredients  might  all 
be  increased  as  a  result  of  the  tax.  This  is  to  some  extent  the  same 
configuration  that  would  lead  to  the  possibility  of  a  tax  on  butter 
reducing  the  price  of  both  butter  and  margarine  to  the  consumer. 
The  one,  however,  does  not  necessarily  involve  the  other,  though 
if  the  commodities  are  extremely  close  competitors,  it  is  likely 
that  the  two  phenomena  will  occur  together  or  not  at  all.  In 
this  particular  instance,  however,  it  appears  likely  that  the 
demand  for  table  fats  is  sufficiently  elastic  and  the  competition 


176  ESSAYS  IN  ECONOMICS  AND  ECONOMETRICS 

between  margarine  and  butter  for  the  use  of  land  is  sufficiently 
indirect  that  the  Edgeworth  phenomenon  is  actually  not  likely 
to  emerge. 

This  by  no  means  exhausts  the  possibilities,  but  it  does  provide 
a  few  hints  as  to  where  to  look  for  the  phenomenon  and  what  its 
nature  is.  In  particular,  although  mathematically  it  is  possible  to 
substitute  for  interdependent  commodities  properly  selected 
"baskets"  that  are  mutually  independent  (at  least  in  a  small 
neighborhood),  this  mathematical  process  need  not  correspond 
to  any  identifiable  "satisfactions"  or  "processes"  as  we  have 
here  assumed,  and  it  would  be  quite  possible  for  the  Edgeworth 
phenomenon  to  occur  without  such  mathematical  breakdowns 
having  any  obvious  real  counterpart.  Moreover,  we  have  examined 
only  the  case  of  two  interrelated  commodities,  and  it  is  clear 
that  if  we  allow  for  more  complex  interrelations  among  several 
commodities  additional  possibilities  may  occur.  But  these  simple 
problems  may  serve  to  provide  some  slight  further  insight  into 
the  problems  of  tax  incidence. 

None  of  the  foregoing  serves  to  invalidate  in  any  way  the 
proposition  that  from  an  inclusive  viewpoint  excise  taxes  that 
produce  divergences  between  marginal  cost  and  price  to  the 
consumer  impair  the  general  welfare.  All  that  has  been  shown  is 
that  under  some  circumstances  the  imposition  of  a  tax  may 
redound  to  the  net  benefit  of  some  of  the  parties  to  the  taxed 
transaction,  in  this  case,  the  consumers.  The  other  parties  (here, 
the  sellers)  will  in  general  lose  by  more  than  the  sum  of  the 
benefit  to  the  gainers  and  the  revenue  to  the  government.  Thus 
unless  the  sellers  are  deemed  in  some  way  less  worthy  of  con- 
sideration than  the  buyers,  such  a  tax  is  still  not  a  desirable 
one  from  the  point  of  view  of  the  community  as  a  whole,  nor  can 
the  discovery  of  an  instance  of  Edgeworth's  phenomenon  be 
considered  necessarily  "welcome  news  to  the  harassed  minister 
of  finance."  For  we  can  write  for  the  excess  of  utilities  over 
costs: 

(25)  S=U-W 

=S-  (A  +a)  (x-x)*-  (B+b)  (x-x)  (y-y)-(C+c)  (y-y)* 

This  function  is  thus  at  a  maximum  when  x  =  x  and  y  =  y. 
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That  this  is  a  true  maximum  and  not  a  minimax  is  assured  by 
the  fact  that  D  is  positive. 

Even  if  we  were  to  admit  that  the  buyers  for  some  reason 
should  be  benefitted  at  the  expense  of  the  sellers,  it  would  in 
general  be  desirable  to  levy  a  direct  tax  on  the  rents,  income, 
or  other  surplus  of  the  sellers  and  pay  a  direct  subsidy  to  the 
buyers,  rather  than  achieve  the  redistribution  by  means  of  an 
excise  tax.  To  be  sure,  if  some  sort  of  excise  is  to  be  used  as  the 
means  of  achieving  this  redistribution,  then  a  tax  in  an  Edge- 
worthian  situation  would  be  relatively  more  effective  in  achieving 
this  redistribution  than  a  tax  on  an  independent  commodity 
where  the  amount  of  redistribution  would  necessarily  be  less 
than  the  tax  revenue.  But  in  general  an  excise  tax  cannot  be 
justified  by  such  an  argument  except  where  the  person  who  is 
to  bear  the  burden  would  otherwise  be  beyond  the  reach  of  the 
taxing  authority.  The  usual  case  of  this  sort  is  when  the  seller 
is  a  foreigner.  In  the  coffee  case  cited  above,  the  redistribution 
carried  out  is  one  in  favor  of  the  domestic  coffee  drinker  at  the 
expense  of  the  Brazilian  coffee  grower.  But  this  is  in  reality 
no  more  than  a  peculiarly  effective  instance  of  monopsonistic 
exploitation  by  the  taxing  country,  and  unless  such  action  is 
taken  as  a  means  of  adjusting  disequilibria  in  international  trade, 
it  could  well  be  considered  a  form  of  economic  warfare. 

In  short  it  is  wrong  to  dismiss  the  Edgeworth  paradox  as 
something  that  can  happen  only  in  rare  and  peculiar  cir- 
cumstances. The  phenomenon  is  not  frequent,  to  be  sure,  but 
there  seems  little  doubt  but  that  cases  can  occur  and  that  the 
circumstances  tending  to  produce  the  phenomenon  can  be  rec- 
ognized if  one  knows  what  to  look  for.  On  the  other  hand, 
even  when  found,  instances  of  the  phenomenon  are  far  from  the 
"grateful  boon"  that  Seligman  thought  that  they  would  provide, 
did  they  exist.  At  least  in  domestic  matters,  existence  of  an 
Edgeworth  phenomenon  in  no  way  justifies  a  departure  from 
normal  standards  of  tax  policy.  It  is  only  where  the  analysis 
relates  to  the  foreign  policy  of  a  nationalistic  state,  bent  on 
deriving  the  maximum  advantage  from  its  strategic  position, 
that  the  discovery  and  exploitation  of  the  phenomenon  becomes 
of  concern. 


CHARLES  E.  FERGUSON 


Modified  Edgeworth  Phenomena  and 
the  Nature  of  Related  Commodities 


I.  Introduction 

In  one  of  his  most  celebrated  contributions  to  economic 
theory,1)  Professor  Hotelling  analyzed  the  so-called  Edgeworth 
Taxation  Paradox,  viz.,  the  proposition  that  an  excise  tax  imposed 
upon  one  of  two  related  commodities  may  result  in  a  decline  in 
the  price  of  both.  Far  from  being  a  mere  curiosum  as  Edgeworth 
suggested,  he  found  that  the  conditions  of  the  Paradox  are  satis- 
fied in  a  wide  class  of  cases  conforming  to  certain  theoretical 
requirements  which  he  adduced  for  demand  and  supply  functions. 
More  specifically,  Professor  Hotelling  proved  that  in  both 
competitive2)  and  monopolistic  markets,  the  Edgeworth  phe- 
nomenon is  possible  if  the  two  commodities  are  substitutes  in 
both  consumption  and  production. 

At  this  quarter-century  anniversary  of  Professor  Hotelling's 
important  paper,  it  seems  appropriate  to  extend  his  analysis  to 
modified  Edgeworth  cases.  In  particular,  we  investigate  the 
consumption  and  production  relationships  that  must  prevail  if 
an  excise  tax  imposed  upon  one  commodity  results  (a)  in  a 
decrease  in  its  price  and  an  increase  in  the  price  of  the  other 
(related)  commodity,  and  (b)  in  an  increase  in  the  price  of  the 
taxed  commodity  but  a  decrease  in  the  price  of  the  related 
product.  As  a  convention,  and  without  loss  of  generality, 
throughout  the  paper  we  assume  that  the  tax  in  question  is 
imposed  on  good  Xv 

x)  Harold  Hotelling,  "Edgeworth's  Taxation  Paradox  and  the  Nature  of 
Demand  and  Supply  Functions,"  Journal  of  Political  Economy,  XL  (1932), 
pp.  577—616. 

2)  "Competition"  is  used  here  in  the  Cournot  sense  of  competition  between 
two  monopolists. 
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II.  Analytical  Framework 

Consider  two  related  commodities,  Xx  and  X2,  whose  demand 
and  supply  functions  are  given  by 

(1)  pt  --=  f{(xv  x2)  and  pt  =  gt(xlh  x2)        (i  =  1,  2) 

respectively;  or,  assuming  that  the  Jacobian  of  transformation 
does  not  vanish,  by  the  inverse  relations 

*i  =  Fi(Pv  Pi)  and  xt  =  Gt(pv  p2)  (i  =  1,  2). 
We  assume  that  these  functions  satisfy  the  theoretical  require- 
ments prescribed  by  Professor  Hotelling  for  the  case  of  unlimited 
budgets.  Denoting  partial  derivation  by  an  additional  subscript, 
these  requirements  are,  in  terms  of  equations  (l):3) 

(2)  fa  =  fn>  gi,  =  gn>  Uii\  <  0,   [gtil  >  0. 
Furthermore,   we  adopt  the  definition  of  complementarity 

suggested  by  Professor  HoteUing  and  others.4)  Specifically,  Xx 
and  X2  are  competitive  or  complementary  according  as 
F12  =  F21  <  0.  However,  since  we  will  use  equations  in  the 
form  displayed  in  (1),  it  is  necessary  to  express  this  definition 
of  complementarity  in  terms  of  the  /-  and  g-f unctions.  Let 

d(xlt  x2) 

d(fiv  Pi) 

Then  obtaining  the  inverse  of  the  matrix  of  partial  derivatives 
by  the  adjoint  method,  we  can  display  the  relationship  between 
the  partial  derivatives  of  the  quantities  with  respect  to  the  prices 
and  the  partials  of  the  prices  with  respect  to  the  quantities: 

-^11  =  //22»    ^12  =  ^21  =  — //21  =   —  Jfl2>    F22  =  Jfiv 

Since  we  have  assumed  that  the  quadratic  form  is  negative 
definite,5)  /  >  0.  Hence  the  sign  of  f12  =  f21,  is  the  opposite 
of  the  sign  of  F12  =  F2V 

3)  Hotelling,  op.  cit.,  p.  591  and  p.  597.  More  specifically,  the  two  important 
properties  are:  (a)  the  "symmetry"  or  "integrability"  conditions;  and  (b) 
negative  (positive)  definiteness  of  the  quadratic  form  of  the  coefficients  of  the 
demand   (supply)  functions. 

4)  Ibid.,  p.  599.  See  also  Oscar  Lange,  "Complementarity  and  Interrelations 
of  Shifts  in  Demand,"  Review  of  Economic  Studies,  VIII   (1940 — 41),  59. 

5)  This  assumption  is  expressed  in  relations  (2)  for  the  /-  and  g-i unctions . 
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Accordingly,  X1  and  X2  are  competitive  or  complementary  in 
consumption  according  as  /12  §  0.  Similarly,  by  an  extension 
of  the  Hotelling  definition,  the  goods  are  competitive  or  com- 
plementary in  production  according  as  G12  =  G21  §  0.  Con- 
sequently, Xx  and  X2  are  also  competitive  or  complementary 
in  production  according  as  g12  =  g21  J  0. 

Before  proceeding  to  the  analysis  of  cases,  it  is  convenient  to 
establish  the  equations  expressing  the  change  in  commodity 
price  that  is  attributable  to  an  excise  tax.  This  is  done  for  both 
competitive  and  monopolistic  markets. 

Let  h{  =  fi  —  gif  so  that 

Ki  =  ha  =  dhJdXj  =  dhJdXi, 
and  designate  the  accompanying  Jacobian  by 

d{h1}  k 


D 


>0, 


d(xv  x2) 

since  the  elements  of  D  are  the  differences  between  the  cor- 
responding elements  of  a  negative  definite  and  a  positive  definite 
quadratic  form.  If  pt  and  xt  are  the  equilibrium  values  of  price 
and  quantity  that  bring  competitive  demand  and  supply  into 
equality,  then 

(3)  M*i>  *8)  =  0  (i  =  1,  2). 
Assume  that  a  small  excise  tax  of  tx  per  unit  is  imposed  upon 
X±  and  designate  the  resulting  prices  and  quantities  by  j>i-\-A/pi, 
Xt  —  Axt.  In  this  situation,  the  demand  price  of  the  taxed 
commodity  must  exceed  its  supply  price  by  the  amount  of 
the  tax.  That  is, 

(4)  h^+AXi,  x2+Ax2)=tv  h2(x1-\-Ax1>  x2+Ax2)=Q. 

Subtracting  equation  (3)  from  equation  (4)  and  applying  the 
definition  of  "derivative",  we  obtain 


(5) 


2  V* 


&uh> 


(i  =  1,  2) 


3=1 


where  du  is  the  Kronecker  delta.  Equations  (5)  are  simultaneous 
linear  equations  which  are  solvable  by  Cramer's  Rule  for  Axt: 


(6) 


Ax,  =  - 


k   K2 
0    ho*, 


Ax* 


hi  o 
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From  equation  (1), 

3=1 

Substituting  the  expressions  for  Axi  from  equation  (6), 

h 

Ap1  =  -p(fnhZ2-  f12h21), 

k 
Ap2  =  -p  (/2A2  —  fwK-d- 

Finally,  substituting  the  expression  ftj  —  g„  for  h^  and  recalling 
that  t±  >  0  and  D  >  0,  we  can  write  the  following  expression 
relating  the  direction  of  change  in  price  to  the  imposition  of  an 
excise  tax  on  the  first  commodity: 

(7a)         sign  (Ap%)  =  sign  (/n/22  -  /ugaa  -  f\2  +  /12g21), 

(7b)         sign  (Ap2)  =  sign  (/22g21  -  f21g22). 

Equations  (7)  express  the  price  changes  attributable  to  an 
excise  on  Xx  when  the  two  related  commodities  are  produced 
by  competing  entrepreneurs.  We  turn  next  to  the  situation  that 
prevails  when  both  commodities  are  produced  by  a  single 
monopolist.  As  Professor  Hotelling  has  pointed  out,  in  the 
monopoly  situation  the  supply  relationship  between  commodities 
is  immaterial.  Accordingly,  we  investigate  the  consumption 
relationships  under  monopoly  using  the  convenient  fiction  of 
costless  production. 

After  the  imposition  of  an  excise  tax  on  the  first  commodity, 
the  monopolist  maximizes  the  resulting  profit  equation 

Pixi  +  Pzx2  —  hxi  =  n  —  hxi> 

thus  determining  quantities  so  that 

nx  =  tv  7t2  =  0,  and  \nti"\  <C  0. 

By  differentiating  the  two  first-order  conditions  with  respect 
to  the  tax  rate,  we  obtain 

2 

2^axit=  &ii>  (i=I,  2) 

3=1 

where  du  is  a  Kronecker  delta.  These  are  two  linear  equations 
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which  may  be  solved  for  the  unknown  tax  derivatives.  Applying 
Cramer's  Rule, 

cof actor  of  n** 
IL^/JI 
Then  using  equation  (1),  the  price  changes  corresponding  to 
a  tax  on  Xx  can  be  determined: 

2  2 

Pa  =  2  /«*/«  =  2  /«  Pu»  (*  =  !>  2) 

or  alternatively: 

(8a)  sign  (flt)  =  sign  (fnVu  +  /ia9?12), 

(8b)  sign  (p2t)  =  sign  (/2l9?11  +  /229?12). 

Equations  (1),  (2),  (7),  and  (8)  and  the  definition  of  complemen- 
tarity provide  all  that  we  need  of  a  purely  formal  nature. 

III.  Modified  Edgeworth  Phenomena  in  a  Competitive  Situation 

Professor  HoteUing  has  shown  that  a  necessary  condition  for 
the  Edgeworth  Paradox  in  a  competitive  market  is  that  the  two 
commodities  compete  in  both  consumption  and  production.6) 
We  now  investigate  the  possible  commodity  relationships  for 
modified  cases  of  the  Edgeworth  Paradox.  In  doing  so,  we 
utilize  the  convenient  feature  that  the  possible  existence,  in  any 
specific  case,  of  the  modified  Edgeworth  phenomena  can  be 
established  by  providing  an  example  of  demand  and  supply 
functions  which  meet  all  requirements  of  the  situation.  On  the 
other  hand,  to  establish  the  impossibility  of  the  phenomena 
requires  a  formal  proof. 

A.  Case  I:  Price  of  X±  Lowered,  Price  of  X2  Raised. 

The  first  modified  case  considered  is  that  in  which  an  excise 
tax  imposed  upon  Xx  results  in  a  decrease  in  its  price,  accom- 
panied by  an  increase  in  the  price  of  the  related  commodity  X2. 
From  equations  (7),  this  requires  that 


(9a)  /n/22  -  fi2  <  fng22  -  /; 

(9b)  /22g12  >  /12g22. 

6)  Hotelling,  op.  cit.,  p.  604. 


12. 


edgeworth's  taxation  paradox  and  commodity        183 

Since,  from  inequalities  (2),  /n/22  —  f\2  >  °>  /n  <  °>  and 
g22  >  0,  to  satisfy  (9a)  we  must  have  —f12g21  >  P..  This  requires 
that  either  /12  <  0  or  g21  <  0,  but  not  both. 

Consequently,  if  g21  <  0,  then  /12  >  0.  Using  this  information 
in  (9b),  together  with  the  knowledge  that  /22  <  0  and  /22gi2  >  ®> 
we  see  that  /12g22  >  0.  Therefore,  it  is  possible  for  Case  I  to 
exist  if  the  two  commodities  are  complementary  in  both  con- 
sumption and  production.  Indeed,  an  example  of  this  which 
satisfies  all  requirements  is  given  by  the  following  set  of  equa- 
tions:7) 

Demand^  r  »  "  5  + **       Supply  (£  =  )  +  *" 


\p2  =  30  +  2xx  —  5x2>  \p2  =  10  —  2#i  +  ^2- 

On  the  other  hand,  since  the  signs  of  /12  and  g21  cannot  be 
the  same,  it  is  impossible  to  obtain  the  desired  results  if  the  two 
commodities  are  competitive  in  consumption  and  complementary 
in  production,  or  vice  versa.  Furthermore,  Case  I  is  impossible 
if  the  commodities  are  substitutes  in  both  consumption  and 
production.  This  situation  requires  /12  <  0  and  g12  >  0.  That 
is,  it  is  necessary  to  satisfy  inequalities  (9)  subject  to  the  addi- 
tional inequalities: 

/i2  <  0,  Si2  >  0,  /u  <  0,  /22  <  0,  gll  >  0,  g22  >  0,  /u/22  -  /J2>0. 

The  last  inequality  above  implies 

(10)  ^>1. 

/12 

Now  since  /22  <  0  and  /12  <  0  by  the  special  hypothesis,  the 
second  inequality  in  (9b)  may  be  written  |/22gi2|  <  l/i2g22|- 
This,  in  turn,  implies 

7)  The  procedure  for  obtaining  examples  of  Edgeworth  or  modified  Edgeworth 
phenomena  is  as  follows:  determine  the  direction  of  the  inequalities  relevant 
for  equations  (7);  then  select  fi}  and  gif  values  to  satisfy  these  inequalities. 
The  fij{gij)  values  are  the  coefficients  of  xi  in  the  demand  (supply)  equations. 
Appropriate  constants  may  be  selected  by  solving  the  demand  and  supply 
equations  simultaneously.  To  check  the  equations,  introduce  an  excise  tax  on 
one  of  the  commodities.  Remembering  that  for  the  taxed  commodity,  demand 
price  must  exceed  supply  price  by  the  amount  of  the  tax,  the  demand  and 
supply  equations,  properly  adjusted,  may  be  solved  simultaneously  for  p€ 
and  xu  both  expressed  in  terms  of  the  tax  rate.  Differentiating  the  expressions 
for  pi  with  respect  to  the  tax  rate  then  confirms  the  result. 
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(H)  /22//12  <  622/512* 

In  light  of  (10),  the  inequality  in  (9a)  may  be  written  l/ng^l  < 

I/12&1L  or 

(12)  /u//12  <  g21/g22. 

Multiplying  (11)  by  (12),  we  have 

/ 11  /  22         £22^12  , 

/l2  &12  #22 

thus  contradicting  the  special  hypothesis  embodied  in  inequality 
(10). 

B.  Case  II:  Price  of  Xx  Raised,  Price  of  X2  Lowered. 

The  second  modification  that  we  investigate  is  the  case  in 
which  an  excise  tax  levied  on  the  first  commodity  results  in  an 
increase  in  its  price  but  a  decrease  in  the  price  of  the  related  good. 
From  equations  (7),  this  result  is  possible  if 

(13a)  /n/22  -  /?2  >  /ii&a  -  Zia&i. 

(13b)  /.225rl2  <  /l2S*22- 

Inequahties  (13)  are  satisfied  by  three  different  sets  of  com- 
modity relationships.  In  the  first  place,  if  the  two  goods  are 
substitutes  both  in  consumption  and  production,  Case  II  is 
satisfied  by  the  following  set  of  equations: 

_  .  f  ft-,  =  41  —  5x,  —  2#2,      0        ,     I  i>i  =  1  +  9a?!  +  4#2, 

Demand  ( J  =  21  -  2^  -  2,1      **&  ( £  =  1  +  4,;  +  2,1. 

Secondly,  Case  II  is  also  possible  when  the  commodities  are 
complementary  in  consumption  and  competitive  in  production. 
An  example  of  demand  and  supply  functions  satisfying  these 
conditions  is  given  by 

^  ,  f  S,  =  12  —  2x,  4-  x2,  0        .     f  fit  =  3  +  x,  +    #2, 

^and  ( £  _  „  +    ^  +  ^         Supply  [  J  _  %  +  ^  +  ^ 

Finally,  if  the  two  commodities  are  complements  in  both  con- 
sumption and  production,  the  modified  Edgeworth  results  can 
be  obtained.  An  example  meeting  these  conditions  is 

■  ,  f  px  =  20  —  5xt  '+  ±x2,      c        .     f  j>x  =  6  +  3^  -  3x2, 

Demand  K1       _  Q    ,     .       '  Supply  r/        .       0       ,    ^ 

\  £2  =  18  +  4xx  -  ±x2,  ™  J  \p2  =  4  -  3*!  +  4z2. 
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On  the  other  hand,  Case  II  is  not  possible  if  the  commodities 
are  substitutes  in  consumption  and  complements  in  production. 
For  if  this  were  so,  /12  <  0  and  g12  <  0,  implying  fZ2gi2  >  0 
and  /12g22  <  0-  But  these  two  inequalities  are  inconsistent  with 
(13b),  and,  accordingly,  the  special  hypothesis  must  be  rejected. 

IV.  Modified  Edgeworth  Phenomena  in  a  Monopoly  Situation 

We  now  investigate  the  feasibility  of  the  modified  Edgeworth 
phenomena  in  a  monopoly  situation,  using  the  same  two  cases 
discussed  in  Section  III  above.  As  stated  previously,  Professor 
Hotelling  has  proved  that  a  tax  on  one  commodity  can  result 
in  a  decrease  in  the  price  of  both  if  the  goods  are  substitutes 
in  consumption,  the  production  relationship  being  less  material 
under  monopolistic  organization. 

A.  Case  I. 

From  equations  (8),  an  excise  tax  imposed  upon  Xx  will  result 
in  a  decrease  in  its  price  and  an  increase  in  the  price  of  X2  if8) 

(14a)  /nPii  +  Zi2<Pi2  <  0, 

(14b)  f12<pn  +  /22<Pi2  >  0. 

Of  the  two  possible  relationships,  Case  I  may  exist  if  the  com- 
modities are  complementary  in  consumption.  This  is  illustrated 
by  the  following  quadratic  demand  functions:9) 

8)  It  should  be  recalled  that  we  have  the  additional  information  that 

fH  <  0,  <pu  <  0,   |  [/«,]  |  >  0,  and  |  [fly,]  |  >  0. 

9)  The  procedure  involved  in  obtaining  examples  for  a  monopolistic  market 
can  be  sketched  briefly.  Determine  the  relevant  inequalities  for  equations  (8) 
and  select  a  set  of  values  for  fi}-  and  cp^  which  satisfy  these  inequalities  and  the 
theoretical  requisites  of  demand  functions.  Next  obtain  the  three  different 
equations  for  71^;  these  equations  involve  xt,  fti  and  fiih.  Using  the  previously 
assumed  values  of  fti  and  postulating  equilibrium  values  for  xt,  these  equations 
may  be  solved  recursively  for  fijk.  Then  postulate  two  quadratic  demand  func- 
tions with  unknown  coefficients  and  constants.  Differentiating  these  quadratic 
functions  twice,  we  obtain  expressions  for  fiik  in  terms  of  the  coefficients  of 
the  demand  functions.  Consequently,  since  values  have  previously  been  deter- 
mined for  ftik,  one  may  obtain  numerical  solutions  for  the  coefficients.  Next 
substitute  the  numerical  values  into  the  demand  functions,  and  the  latter  into 
the  profit  equation  of  the  monopolist.  Then  state  the  first-order  conditions 
for  maximum  profit  and  substitute  the  assumed  equilibrium  values  of  xt  into 
these  equations.  The  resulting  expressions  may  be  solved  for  the  constants 
of  the  demand  equations.  The  functions  thus  obtained  may  be  checked  by 


186  ESSAYS  IN  ECONOMICS  AND  ECONOMETRICS 

Pi  ==    2X1         xlx%  —  5#i  H~  3a?2  ~f"  ~%~> 

1>2  =  -\*\  +  \A  +  3^  -  2z2  -  2. 
On  the  other  hand,  Case  I  is  impossible  if  the  commodities 
are  competitive  in  consumption.  For  if  so,  /12  <  0.  Then  with 
fu  <  0  and  <pu  <  0,  in  order  to  satisfy  (14a),  we  must  have 
<p12  >  0.  But  then  \flx<pn\  <  \f12<Pi2\>  or 

(15)  /11//12  <  -W<Pn- 

To  satisfy  (14b)  it  is  necessary  that  |/12^nl  >  I/m^uI*  or 

(16)  /22//12  <  —<Pul<Pi2> 
Multiplying  (15)  by  (16),  we  obtain 

(17)  /l^2<^2^1=L 

/l2  9?11 9?12 

But  since  fti  are  the  coefficients  of  a  negative  definite  quadratic 
form, 

(18)  fnfj&  >  L 

Since  (17)  and  (18)  are  inconsistent,  the  hypothesis  is  contra- 
dicted. 

B.  Case  II. 

In  some  ways,  Case  II  in  the  monopoly  situation  is  the  sim- 
plest of  all  cases,  for,  irrespective  of  the  commodity  relationship 
that  prevails,  this  modified  Edge  worth  phenomenon  is  possible. 
That  is,  Case  II  may  occur  if  the  commodities  are  either  com- 
plementary or  competitive  in  consumption.  Examples  of  the 
two  situations,  in  the  order  designated,  are  provided  by  the 
following  sets  of  demand  functions: 

Pi  ==  "2*^1  X-±X2  X2  X-y  -\-  '*X2  2", 

p2  =  —\x\  —  2xxx2  +  f  x\  +  4^  —  5x2  +  2; 

Pi  ~         "2^1  X±X2  2,X2  -f-  X-±  -f-  X2  -\-  2, 

p2  =      2*^1       ^1^2       2^2  ~r  xi  ~r  4. 

postulating  a  tax,  and  then  differentiating  the  first-order  conditions  with 
respect  to  the  tax  rate.  Next  assume  that  the  tax  is  an  incipient  excise  and 
substitute  the  assumed  equilibrium  values  of  x%  into  the  nit  equations,  obtaining 
two  simultaneous  equations  in  the  unknowns  xit.  These  may  be  solved  and 
substituted  into  the  expressions  for  fijf  completing  the  check. 
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V.  Summary 

We  are  now  in  a  position  to  summarize  the  results  of  this 
investigation. 

A.  Competition: 

1.  Professor  Hotelling  has  proved  that  the  two  related  com- 
modities must  be  competitive  in  both  consumption  and  produc- 
tion if  the  prices  of  both  are  to  be  lowered  by  the  imposition 
of  an  excise  tax  on  one  of  the  goods. 

2.  Price  of  taxed  commodity  lowered,  price  of  other  good 
raised: 

a.  this  situation  can  result  if  the  two  commodities  are  com- 
plementary in  both  consumption  and  production; 

b.  impossible  if  commodities  are  substitutes  in  both  con- 
sumption and  production,  or  if  they  are  either  complements  in 
production  and  substitutes  in  consumption  or  substitutes  in 
production  and  complements  in  consumption. 

3.  Price  of  taxed  good  increased,  price  of  other  decreased: 

a.  possible  if  the  two  commodities  are  substitutes  in  both 
consumption  and  production,  complements  in  consumption  and 
substitutes  in  production,  or  complements  in  both  consumption 
and  production; 

b.  impossible  if  the  two  commodities  are  substitutes  in  con- 
sumption and  complements  in  production. 

B.  Monopoly: 

1.  Professor  Hotelling  has  proved  that  the  price  of  both 
commodities  may  be  decreased  by  a  tax  on  one  of  them  provided 
the  commodities  are  substitutes  in  consumption. 

2.  Price  of  taxed  commodity  lowered,  price  of  other  raised: 

a.  possible  if  commodities  are  complementary  in  consumption; 

b.  impossible  if  commodities  are  substitutes  in  consumption. 

3.  Price  of  taxed  commodity  raised,  price  of  other  lowered: 
this  case  is  possible  irrespective  of  the  consumption  relationship 
which  prevails  between  the  commodities. 


PART  V 

On  Econometric  Methods 


RAGNAR  FRISCH 


Market  Prices  vs.  Factor  Costs  and  the 
Constancy  of  Production  Coefficients 


The  question  of  whether  it  is  most  appropriate  to  use  market 
prices  or  factor  costs  in  national  income  statistics  and  more 
generally  in  economic  model  work,  has  been  much  discussed. 
I  shall  offer  some  comments  on  this  question.  Or  rather,  I  shall 
look  at  this  whole  question  from  a  somewhat  more  general 
viewpoint  more  adaptable  for  decision  model  work.  Instead 
of  factor  input  as  distinguished  from  indirect  taxes  (minus 
subsidies),  I  shall  consider  the  proportional  input  elements  as 
distinguished  from  the  residuum. 

I  will  try  to  bring  out  that  in  so  far  as  the  global  national 
product  is  concerned,  the  effects  of  including  only  the  (propor- 
tional) factor  costs  instead  of  including  the  total  market  values, 
disappear  to  a  large  extent  when  the  values  are  deflated  so  as  to 
bring  out  the  volume  figures  instead  of  the  value  figures. 

In  so  far  as  the  production  coefficients  are  concerned,  the  dif- 
ference between  alternative  kinds  of  figures  is  more  complicated. 
We  have  to  distinguish  between  at  least  four  different  types  of 
magnitudes:  strictly  physical  quantities,  volume  figures,  semi 
volume  figures  and  current  values.  For  each  of  these  types  we 
must  look  into  the  question  of  the  constancy  of  the  coefficients. 

For  practical  reasons  the  usual  input  output  coefficients  are 
as  a  rule  computed  as  ratios  between  market  values  observed 
in  a  base  year.  This  is  also  done  in  Norwegian  work  and  in  a 
general  way  I  agree  to  this  procedure  for  the  reasons  given  in 
the  sequel.  There  are,  however,  special  purposes  for  which  some 
modifications  may  be  used.  Compare  the  comments  below  on 
the  method  followed  in  the  Oslo  median  model. 

A.  The  Strictly  Physical  Structure 

To  bring  out  the  essence  of  the  problem  as  it  appears  in  a 
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decision  model,  let  us  first  consider  a  table  with  all  final  deliveries 
aggregated,  and  with  only  strictly  physical  quantities  involved 
so  that  no  vertical  summations  are  possible.  The  result  of  such 
a  set  up  is  given  in  Table  1.  The  strictly  physical  quantities 
are  denoted  by  small  letters. 

Table  1 
Input-Output  of  Strictly  Physical  Quantities 


Receiving  sector 
No. 

Final 
delivery 

Total 
delivery 

h  =  1 

2 

Delivering 
sector  No. 

k  =  1 
2 

0 

^21 

^12 
0 

#2* 

xx 

x2 

Primary 
input 

Labour 

Non  compe- 
titive imports 

w1 

w2 

If  we  do  not  impose  any  other  relations  than  the  definitions 
of  the  total  products,  i.e., 

/-.v  ^12  "T  #i*  —  xl 

^21  ~T~  ^2*   Z=  "^2 

we  have  10  —  2  =  8  degrees  of  freedom. 

If  we  introduce  the  6  production  coefficients  by 


12 


Xy^Xn 


'21 


(2) 


w. 


C/o    On  Xq 


and  for  the  moment  consider  all  these  coefficients  as  variables, 
we  have  16  variables  and  8  equations,  hence  still  8  degrees  of 
freedom.  As  basis  variables  we  can  choose  for  instance  the  6 
coefficients  and  xlt  x2.  Or  we  can  choose  the  6  coefficients  and 

Xjx,  x 

(3) 


2H:.  The  8  basis  equations  are  in  the  first  case  (2)  together  with 

JC-t  ~  X-%  JC-t  o  Xn 


'lx  '"l 

In  the  second  case  they  are 


x. 


2x 


X 


(4) 


X 


1* 


*^12  *^2* 


+  3?, 


1       x12  x21 


1  '       ' 

1  ^12^21 
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together  with  the  6  expressions  obtained  by  inserting  (4)  into  (2). 
This  is  the  structure  of  the  system  expressed  in  strictly 
physical  terms.  The  case  of  constant  coefficients  is  covered 
simply  by  putting  the  coefficients  in  the  basis  equations  equal 
to  their  constant  values.  This  leaves  us  with  2  degrees  of  freedom, 
which  may,  for  instance,  be  unfolded  by  xv  x2  or  by  #lsi4J 


#2*' 

B.  The  Structure  Expressed  in  Volume  Figures 

Now  let  us  introduce  a  set  of  product  prices  nlf  n2  and  factor 
prices  tzw,  nb.  We  call  them  standard  prices.  Let  us  see  how  the 
various  constellations  of  the  system  which  are  physically  possible 
with  the  degrees  of  freedom  in  (1),  can  be  expressed  in  the  value 
terms  derived  from  the  standard  prices.  We  also  introduce 
residual  items  ev  s2  in  the  two  production  sectors.  The  residual 
items  may  be  the  sum  of  taxes  Th  and  net  profits  (savings)  Sh. 
For  practical  purposes  in  a  decision  model  these  residual  items 
are  very  important,  but  their  introduction  causes  considerable 
complications  in  the  definitional  set  up.  These  difficulties  we 
must  consider  in  a  systematic  way. 

The  new  figures  are  listed  in  Table  2.  We  could,  if  we  wanted 

to,  introduce  different  wage  rates  in  the  two  sectors  and  also 

different  import  prices,  but  that  is  unessential  in  the  present 

connection. 

Table  2 

Interflow  Table  of  Values  Reckoned  at  Standard  Prices  and  with  Standard  Residuum 

Elements 


Receiving  sector 
No. 

h  =  1         2 

Final 
delivery 

Total 
delivery 

Delivering 
sector  No. 

k  =  1 

2 

0        n1  x12 

n%  x21        0 

7l1  #x* 

^2  *^2* 

nxx1 

7l2  %2 

Primary  input 

Labour 
Non  compe- 
titive im- 
ports 

7ibb1      n^bi 

— ^(^x  +  tt'a) 

0 
0 

Residual  input 

el               e2 

-(£i+£2) 

0 

Grand  total 

7Tx  >^i         71%  •£% 

0 

nxxx+n2x2 
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The  prices  nx  and  tz2  are  actual  prices  per  physical  unit,  say 
per  kilogram  or  per  kWh.  The  wage  rate  tcw  is  also  reckoned 
per  physical  unit,  say  per  hour  of  work.  Similarly  for  7ib.  The 
residual  input  is  measured  in  money. 

It  should  be  understood  that  Tables  1  and  2  exist  simul- 
taneously, and  that  the  actual  physical  quantities  in  the  two 
tables  are  the  same. 

In  Table  2  we  have  imposed  the  condition  that  the  sum  in 
column  No.  h  shall  be  equal  to  nhxh.  This  is  equivalent  with 
defining  the  residual  eh  when  the  prices  are  given.  Or  we  may 
inversely  consider  the  condition  as  defining  the  prices  nh  in 
terms  of  the  residual  inputs  elf  e2.  We  will  most  of  the  time 
adopt  the  latter  viewpoint. 

The  column  sum  conditions  are  expressed  by 

7t2x21  +  7tww1  +  7tbb1  +  e±  =  ti1x1 
(5) 

^1^12  +  KwW2  +  ^6^2  +  £2  =  ^2^2- 

We  define  the  residual  rates  ex  and  e2  by 

(6)  ek  =  ekxk     (k  =1,  2). 

If  need  be,  these  residual  rates  will  be  called  direct  residual 
rates  to  distinguish  them  from  certain  aggregate  residual  rates 
to  be  considered  later. 

This  gives  4  equations  in  addition  to  the  8  we  had  before. 
The  additional  variables  are 

Number  of 
variables 

4  7tv    7Z2,    7lw,    7lh 

(7)  2  ev  s2 
2             sv  s2 


8  Total 


In  other  words  we  have  8  —  4  =  4  more  degrees  of  freedom 
than  in  the  table  of  strictly  physical  quantities.  As  additional 
basis  variables  we  choose  the  factor  prices  nw,  7th  and  the  residual 
rates  ev  s2.  Using  xlt  x2  rather  than  #1Hc  and  x2%  as  basis  variables, 
the  total  set  of  basis  variables  will  be 
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#]_,    X2 

JCW,    7lb 
(8)  £"l,    S2  ^ 

x'12>  x2V  w[,  w2,  b[,  b2. 

If  all  the  6  coefficients  listed  on  the  last  row  in  (8)  are  taken 
as  given,  there  are  6  basis  variable  left,  namely  those  on  the 
first  three  rows  of  (8).  Of  these  xlt  x2  determine  the  physical 
constellation  of  the  system  —  all  the  other  physical  features 
following  from  xv  x2  —  while  tzw,  nh,  ev  s2  determine  the  price 
features  —  the  other  prices  following  from  nw,  nh,  ev  e2. 

The  prices  nx  and  n2  as  functions  of  the  coefficients  and  the 
basis  price  elements,  are  determined  by  inserting  into  (5)  from 
(2)  and  (6)  which  gives  the  system  of  two  equations 

tix  —  7i2x21  =  n„w\  +  nhb[  +  s± 
—nxx12  +  n2  =  nww2  +  Tthb2  +  e2. 

The  solution  of  this  is 


(10)  %  = 

1  X^2  3?2i 

In  the  case  of  n  sectors   (9)  has  the  form 

n 

(11)  2**0  —  *')**  =  n*w'h  +  nbb'h  +  *h     (h  =  1,  2  .  .  .n) 

k=l 

which  is  solved  by 

n 

(12)  nk  =  2  (rcwwj  +  wd6i  +  e*)((5  -  z')£     (k  =  1,  2,  .  .  .  »). 

The  formulae  (10)  show  that  if  not  only  the  factor  prices, 
but  also  the  direct  residual  rates  are  constant,  the  product 
prices  will  also  be  constant.  In  other  words  the  whole  structure 
of  standard  prices  will  be  fixed.  We  consider  them  as  base 
prices  and  take  the  corresponding  values  as  defining  volume 
figures. 

We  put 


nv{wx 

+ 

w2 

X21 ) 

+ 

nb{bx  +  &J«ii)  +  (ei  + 

£2X21) 

nu(w[ 

x12 

+ 

K) 

+ 

i          '     ' 

l          ^2*^21 

+ 

£2) 
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Xk  =  7TkXk 


^■k*  —  nkXk* 


^■kK  —  nkXkh 


(13) 
(14) 
(15)  Wh  =  7iwwh         Bh  =  7tbbh. 

The  volume  figures  defined  by  (13) — (15)  are  entered  in 
Table  3.  That  is  to  say,  if  the  numerical  figures  are  entered, 
Table  2  and  Table  3  will  be  exactly  the  same. 

Keeping  the  price  structure  —  as  defined  through  nw>  7ib,  ev 
s2  —  constant  and  varying  xx  and  x2,  we  get  different  constella- 
tions of  the  volume  figures.  Since  it  is  simply  a  question  of  units 
of  measurement  to  pass  from  the  system  of  strictly  physical 
quantities  to  the  volume  figures,  we  can  just  as  well  think  of 
Xx  and  X2  as  varying  under  constant  nv 


7l} 


1>    °2' 


Table  3 

Interflow  Table  of  Volume  Figures  Reckoned  under  Base  Year  Prices  and  Base 
Year  Residual  Elements 


Receiving  sector 
No. 

Final 
deliveries 

Total 
deliveries 

h  =  1       2 

Delivering 
sector  No. 

k  =  1 

2 

0           X12 
X21       0 

X2# 

^1 

Primary  input 

Labour 
Non  compe- 
titive im- 
ports 

wx      w2 

B1         B, 

-(^1  +  ^2) 
~(B1+B2) 

0 
0 

Residual  input 

el                e2 

-(ei+£2) 

0 

Grand  total 

xx      x2 

0 

xx+x2 

The  introduction  of  the  volume  figures,  i.e.  the  magnitudes 
denoted  by  capital  letters  in  (13) — (15),  does  not  change  the 
number  of  degrees  of  freedom  because  to  each  new  magnitude 
corresponds  one  definitional  equation.  If  we  assume  that  in 
(8)  not  only  x±  and  x2,  but  also  sL  and  e2  are  changing  while  the 
factor  prices  nw  and  nh  as  well  as  the  6  physical  quantity  coef- 
ficients are  given,  we  have  4  degrees  of  freedom.  (In  n  sectors 
2n  degrees  of  freedom).  From  the  discussion  in  the  sequel  it  will 
appear  that  we  will  reserve  the  terminology  volume  figures,  as 
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defined  through  (13) — (15),  to  the  case  of  constant  residual 
rates.  These  constant  rates  we  can  assume  as  observed  by  the 
actual  situation  in  a  base  year.  Now  there  remain  only  two 
degrees  of  freedom  in  the  volume  figures.  They  may  be  unfolded 
say  by  Xx  and  X2.  (In  n  sectors  n  degrees  of  freedom). 

Table  3  has  at  the  same  time  the  following  two  properties: 
(1)  the  magnitudes  entering  into  it  have  the  character  of  volume 
figures  (because  they  represent  values  at  base  year  prices), 
and   (2)  vertical  summations  in  the  table  are  possible. 

I  shall  look  a  little  closer  into  the  particular  aspect  of  the 
question  that  is  represented  by  the  constancy  of  the  residual 
rates. 

C.  Constant  Factor  Prices  and  Constant  Residual  Rates  Entail 
Constant  Input-Output  Volume  Coefficients. 

We  define 
(16)  X'k&  =  ^ 

a?)  w'*  =  ir    b'»  =  y- 

From  the  definitions  (13) — (15)  follows  that  the  coefficients 
(16)— (17)  are  equal  to 

(18)  *»=-** 
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(i9)  *n=^x  B'h=^b'h 

where  the  7tk  are  given  by  (12),  and  nw,  nb  are  the  given  factor 
prices.  If  all  the  production  coefficients  x'12)  x'21,  w'v  w2  etc. 
reckoned  in  strictly  physical  quantities  are  constant,  we  see  from 
(10)  and  (16) — (17)  that  constant  factor  prices  and  constant 
residual  rates  sk  entail  constant  Xkh,  W'h  and  B'h. 

The  conditions  of  constant  residual  rates  ex  and  s2  can  be 
transformed  into  corresponding  conditions  about  the  residual 
rates  expressed  as  fractions  of  Xx  and  X2.  Indeed,  from  (6) 
and  (13),  we  get 
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(20)  ek  =  e'kXk 

where 

(2i)  4  =  ii. 

The  last  formula  taken  in  conjunction  with  (10)  shows  im- 
mediately that  if  we  have  constant  factor  prices  and  constant 
quantity  coefficients,  constant  ek  rates  will  entail  constant  s'k 
rates. 

On  the  other  hand,  if  the  marginal  rates  sk  are  given,  instead 
of  the  ek,  we  deduce  from   (9)  by  inserting  from   (21) 

7^(1  -  e'J  -  jz2x'21  =  nww[  +  n^b'x 
—n^x'^  +  n2{\  —  e'2)  =  7tww2  +  iibb2. 

From  the  equations  (22)  the  nk  are  determined.  That  is  to  say: 
Constant  coefficients  in  the  strictly  physical  structure,  constant 
factor  prices  and  constant  residual  rates  sk  entail  constant 
prices  nk  and  hence  by  (21)  constant  sk. 

It  is  the  same  set  of  prices  nlt  n2  that  is  determined  from  (9) 
and  (22)  only  the  data  are  taken  in  a  slightly  different  form. 
The  generalization  to  n  sectors  is  obvious. 

This  means  that  if  the  residual  rates  s[  and  s2  as  defined  by 
(20)  are  constant,  we  can  reason  about  the  volume  figures  in 
Table  3  very  much  in  the  same  way  as  we  can  about  the  strictly 
physical  quantities. 

To  be  more  precise:  In  Table  3  there  are  12  variables  connected 
by  the  12  equations 

f23)  *i  =  X21  +  W,  +  Bx  +  £l  =  X12  +  Xlx 

X2  =  X12  +  W2  +  B2  +  s2  =  X21  +  X2x 

and  (16)— (17)  and  (20)  where  the  coefficients  Xkh>  W'h,  B'h 
and  s'h  are  constants  (and  hence  may  be  determined  by  observing 
the  content  of  Table  3  in  a  base  year).  The  first  two  equations 
in  (23)  reduce  to  conditions  on  the  coefficients.  Hence  two 
degrees  of  freedom  in  the  variables.  This  checks  with  the  remarks 
in  connection  with  Table  3. 

The  two  degrees  of  freedom  that  remain  in  Table  3  under  the 
conditions  specified  —  constant  physical  coefficients,  constant 
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factor  prices  and  constant  residual  rates  —  may  be  generated 
by  letting  Xt  and  X2  vary.  Or  we  may  use  X^  and  X2*  as  basis 
variables  and  use  other  equations  to  express  X±  and  X2. 

D.  Constant  Coefficients  in  the  Volume  Sense  Entail  Constant 
Residual  Rates 

If  we  assume  a  model  of  Table  3  where  (16) — (17)  hold 
with  constant  coefficients,  we  can  conclude  that  the  residual 
rates  e'k  defined  by  (20)  are  constants. 

Indeed,  introducing  into  the  left  hand  equations  in  (23)  — 
which  follow  from  the  balancing  principles  of  Table  3  —  the 
expressions  for  Xk,  Xkh,  Wh,  Bh  from  (16) — (17),  we  get 

m)  4=1-  (X21  +  W[  +  B[) 

[     }  4=1-  (*Ii  +  K  +  B'%). 

Hence:  If  X'kh,  W'h,  B'n  are  constants,  e[  and  s'2  must  also  be 
constants.  We  are  thus  back  to  the  same  type  of  analysis  as  was 
discussed  under  subsection  C. 

Having  reduced  in  this  way  the  whole  formulation  to  the  figures 
contained  in  Table  3,  we  may  drop  the  assumption  of  an  under- 
lying strictly  physical  structure  which  we  started  from,  and 
simply  reason  about  the  figures  of  Table  3  as  value  figures 
reckoned  at  base  year  prices.  This  formulation  will  apply  even 
though  there  is  a  great  variety  of  individual  goods  that  enter 
into  each  aggregate  Xk  or  Xkh  etc.  For  all  practical  purposes  these 
figures  could  be  interpreted  as  volume  indices.  And  it  would 
seem  plausible  in  many  cases  to  make  the  assumption  of  constant 
input-output  coefficients  reckoned  in  such  figures. 

If  we  take  the  volume  figures  as  the  basis  of  the  analysis,  the 
product  prices  tz1}  ti2  become  indetermined,  and  the  same  is  true 
of  the  factor  prices  tzw,  nh.  Indeed,  if  in  (22)  we  insert  for  xkh 
from  (18),  and  similarly  use  (19)  we  simply  get  back  to  (24). 
The  product  prices  and  factor  prices  can  now  simply  be  looked 
upon  as  conventional  multipliers  by  which  we  define  "the 
strictly  physical  quantities"  in  (13) — (15).  If  the  "strictly 
physical  quantities"  are  well  defined  and  observable,  we  can, 
of  course,  deduce  the  factor  prices  nw,  nh  and  product  prices 
n1}  7i2  that  must  prevail  in  order  that  we  shall  get  the  observed 
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volume  figures  (in  base  prices  year  prices)  Xv  X2,  Wlf  W2  etc. 

E.  The  Aggregate  Residual  Rates 

The  residual  rates  e'h  express  the  input  of  residual  substance 
that  is  made  directly  into  sector  h,  reckoned  per  unit  of  total 
output  Xh  from  sector  h.  We  can  also  consider  the  aggregate 
residual  rates  sh  defined  by 

(25)  sh=  that  part  of  Xh  which  is  due  to  the  input  of  residual 

substance  in  any  sector,  assuming  that  all  residual 
substance  is  everywhere  passed  on  to  other  sectors  or 
to  final  output  in  the  same  proportion  as  the  volume  of 
cross  deliveries  or  the  final  deliveries.  In  other  words 
all  units  of  output  from  a  given  sector  contains  the  same 
amount  of  residual  substance. 
When  the  aggregate  residual  rates  are  defined  in  this  way, 
they  must  satisfy  the  equations 

(26)  £]  ~  *f«  2  1 

The  first  equation  in  (26)  is  obtained  by  noticing  that  the 
total  outflow  of  residual  substance  from  sector  1  is  e'1X1.  This 
must  be  equal  to  the  total  inflow  of  residual  substance  into 
sector  1,  namely  the  residual  substance  entered  directly  into 
sector  1  —  this  is  e^X-^  —  plus  the  residual  substance  that  is 
entered  into  sector  1  through  X21  —  this  is  e'2X21  — .  Dividing 
this  equality  by  Xv  we  get  the  first  equation  (26).  Similarly 
for  the  second  equation  in  (26). 

The  inputs  of  labour  Wh  and  imports  Bh  are  not  to  be  entered 
in  the  above  account  as  they  are  by  definition  not  residual 
elements.  But  we  could  have  singled  out,  say  Wk,  and  considered 
the  direct  coefficient  W'h  as  distinct  from  the  aggregate  coefficient 
W'h.  The  reasoning  would  be  the  same  as  in  (26). 

The  generalization  of  (26)  to  n  sectors  is  obvious,  namely 

(27)  24(<5 -*')*»  =  4         (h=l,2,...n). 

ft  =  l 

The  solution  of  this  is 
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(28)  «;  =  |  e'h(d  -  X')j*     (k=l,2,...  n). 

If  we  take  the  volume  figure  coefficients  X'kh  etc.  as  data, 
the  meaning  of  the  matrix  in  (27)  is  clear.  If  on  the  other  hand 
we  go  back  for  a  moment  to  the  interpretation  in  terms  of  strictly 
physical  quantities  and  with  constant  physical  coefficients  and 
constant  factor  prices,  we  must  remember  that  the  volume 
figure  coefficients  X'kh  in  (27)  depend  on  the  e'h.  The  volume 
figure  coefficients  Xkh  will  indeed  in  this  case  have  to  be  looked 
upon  as  determined  by  (18)  where  the  nk  are  given  by  (22), 
and  hence  depend  on  the  s'h.  This  means  that  if  we  fall  back  on 
the  constancy  of  strictly  physical  coefficients,  we  cannot  deter- 
mine the  sk  for  different  e'h  by  retaining  the  left  member  matrix 
in  (27)  and  just  changing  the  right  member  vector  en.  Both 
the  matrix  and  the  vector  will  have  to  be  changed.  On  the  other 
hand,  if  volume  figure  coefficients  are  taken  as  given,  we  cannot 
change  the  sh  but  must  let  these  magnitudes  be  determined  by 
(24).  The  equations  (26)  have  therefore  no  use  for  determining 
the  residual  rates.  The  only  purpose  of  the  equations  is  to  pass 
from  the  direct  rates  eh  to  the  aggregate  rates  eh  or  vice  versa. 

It  is  a  fundamental  proposition  in  input-output  theory  that 
equations  of  the  form  (26)  will  have  non  negative  solutions 
sh  if  the  sn  are  non  negative. 

If  we  multiply  the  first  equations  in  (26)  by  Xx  and  the  second 
by  X2  and  add  the  equations,  we  get,  using  the  equality  between 
the  left  and  right  members  of  (23) 

(29a)  ^i^i*  +  £2X2*  =  ei  +  H- 

That  is  to  say  the  total  residual  substance  contained  in  the  final 
delivery  is  equal  to  the  total  residual  substance  put  into  the 
system. 

Since  the  eh  are  non  negative  (and  at  least  one  of  them  positive 
if  at  least  one  of  the  sh  are  positive),  we  see  that  the  total  residual 
substance  contained  in  the  sectors  products  must  be  larger  than 
the  total  residual  substance  put  into  the  system,  i.e. 

(29b)  e\X1  +  e\ X2  >  e±  +  £2. 

This  double  counting  which  prima  facia  appears  a  little  puzzling, 
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is  easily  explained:  The  global  product  X±  -f-  X2  has  itself 
emerged  after  some  double  counting.  In  Xx  -f  X2  is  indeed 
included  not  only  the  total  primary  and  direct  residual  input, 
but  also  all  crossdeliveries.  This  follows  by  taking  the  sum  of 
the  left  hand  equations  in  (23),  which  gives 

(30)    Xx+X%  =  (W1+WM)  +  {B1+ti%)  +  (*i  +  h)  +  (Xij+Xti 

That  is  to  say  the  sum  of  all  sector  products  will  increase  if  we 
split  the  sectors  further  up. 

The  double  counting  is  only  in  the  total  sector  products,  not 
in  the  aggregate  residual  rates  sv  s2  as  is  seen  from  (29a). 

For  clear  thinking  in  the  variety  of  situations  that  arise  ac- 
cording to  the  various  systems  of  assumptions  adopted  it  is 
essential  to  be  very  careful  in  the  notation.  It  is  indeed  safe  to 
be  so  explicit  as  nearly  to  appear  pedantic. 

We  will  from  now  on  let  the  symbols  used  in  subsections  C  and 
D,  i.e.  Xk,  Xkh>  X'kh  etc.  and  the  corresponding  coefficients  be 
strictly  interpreted  as  the  volume  figures  and  volume  figure 
coefficients,  that  appear  when  the  residual  rates  e'n  are  constants 
and  have  a  specific  set  of  values. 

These  volume  figures  themselves  are  recorded  in  Table  3  and 
the  corresponding  coefficients  are  defined  in  (16) — (17).  With 
given  and  constant  coefficients  the  degrees  of  freedom  in  this 
model  is,  as  already  stated,  equal  to  n,  the  number  of  sections, 
i.e.,  in  Table  3  it  is  equal  to  2. 

F.   The  Structure  in  Semi-Volume  Figures 

An  essentially  new  situation  arises  if  we  drop  the  assumption 
of  constant  residual  rates.  We  can  discuss  this  situation  by  going 
back  to  the  structure  expressed  in  strictly  physical  terms.  We 
assume  constant  technical  coefficients  in  this  physical  structure 
and  also  constant  factor  prices,  but  the  direct  residual  rates  may 
now  be  changing.  And  as  they  change,  they  will  produce  changing 
product  prices  and  hence  changing  value  figures.  These  value 
figures  we  will  term  the  semi-volume  figures.  In  this  way  of 
thinking  there  are  2n  degrees  of  freedom,  represented,  say,  by 
the  n  physical  quantities  and  the  n  residual  rates. 

Instead  of  discussing  the  semi-volume  structure  by  the  help 
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of  the  strictly  physical  quantities,  we  can  also  do  it  through 
the  residual  rates  and  the  volume  figures  as  they  were  defined 
under  subheadings  B — E,  see  in  particular  the  comments  in 
the  last  part  of  subsection  E. 

All  the  semi- volume  magnitudes  will  be  denoted  by  the  super- 
script sem  (standing  for  semi-volume). 

The  new  situation  will  be  described  by  a  table  similar  to  Table  2 
namely  by  Table  4  and  it  is  through  the  balancing  equations  of 
this  new  table  that  the  product  price  concept  gets  a  meaning. 

Using  an  interpretation  in  terms  of  the  strictly  physical 
structure,  we  are  particularly  interested  in  the  connection  between 
the  <m  and  ef m  and  £s2em. 

We  put  up  the  following  definitions,  which  are  similar  to 
(13)-  (15). 

(31)  Xk     =  nk    xk         Xk:¥   —  7ik    xk% 

(32)  XIT  =  nrxkh 

(33)  w*r  =  <em ™n    £?*m  =  <*m  K 

The  semi- volume  figures  Xkm,  Xk™  etc.  measure  the  production 
levels,  the  cross  deliveries  etc.  when  the  residual  rates  are 
chosen  as  skem  instead  of  the  e'k  that  are  associated  with  the 
measurement  of  the  volume  figures  Xk,  Xkh  etc. 

In  general  we  will  assume 

(34)  7lS*m  —  7tw  7cfm  =  7lb 

but  for  the  symmetry  of  the  formulae  we  may  retain  the  notation 
7is*m  and  j^em.  Instead  of  Table  3  we  now  get  Table  4. 

It  should  be  understood  that  Tables  2,  3  and  4  —  as  well  as 
a  table  similar  to  Table  2  with  sem  added  as  superscript  on  the 
n  and  s  —  exist  at  the  same  time. 

In  the  complete  system  now  considered  we  again  have  2n 
degrees  of  freedom  which  may  be  unfolded  by,  say,  the  Xk 
and  the  skem.  In  the  case  of  2  sectors,  there  will  be  4  degrees 
of  freedom.  In  the  strictly  physical  system  we  also  had  4  degrees 
of  freedom,  when  the  factor  prices  jiw  and  nb  as  well  as  the  6 
production  coefficients  in  (8)  were  given. 

If  we  lean  on  the  interpretation  in  strictly  physical  quantities, 
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it  is  easy  to  indicate  what  the  semi-volume  figures  will  be  in 
terms  of  the  volume  figures  Xk,  Xkh  etc. 

Table  4 

Interflow  Table  of  Semi-Volume  Figures  Reckoned  under  the  Prices  that  Prevail 
when  Factor  Prices  are  Constant  and  Residual  Rates  are  Arbitrarily  Given 


Receiving  sector 
No. 

Final 
deliveries 

Total 

h  =  1 

2 

deliveries 

Delivering 
sector  No. 

k  =  1 

2 

0 
A21 

vsem 
A12 

0 

vsem 

vsem 
^2* 

■y-sem 

Al 

v-sem 
A2 

Primary- 
input 

Labour 
Non  compe- 
titive im- 
ports 

Wl 

7-,sem 
Bl 

B2 

-(Bfm+Bs°m) 

0 
0 

Residual  input 

sem 

sem 
e2 

,  sem  ,     sem,. 
-  (e1     +e2     ) 

0 

Grand  total 

xfm 

^sem 
X2 

0 

Indeed,  adding  the  superscript  sem  for  the  price  elements  in 
(22)  (except  for  the  factor  prices,  which  are  the  same)  and  using 
(31)— (33),  we  get 


(35) 


7l\ 


—n\ 


1 


'sem 


•^2      *^21  —  ^"vo  ^1 


KbK 


'12 


_sem  ( i 
7l2      (1 


2Sem)   =KWW2  ^^bK 


where 
(36) 
Similarly  we  get 


(37) 

where 
(38) 


7i; 


—  717     X 


'21 


12 


Tie, 


"V'sem 


nm  w. 


7l„,  W, 


nbK  +  £] 
nb  K  +  i? 


i.e. 
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Jsem 

(39)  4sem  =  -i- . 

nk 

We  may  look  upon  the  two  sets  of  prices  nk  and  n\* m  simply 
as  special  values  assumed  by  the  product  price  functions  for 
different  values  of  the  residual  rates  considered  as  arguments 
in  these  functions. 

Through  (13)— (15)  and  (31) — (34)  we  get,  remembering 
that  the  strictly  physical  quantities  xk,  xhk  etc.  are  independent 
of  how  residual  rates  are  chosen 


(40) 
(41) 
(42) 


(43) 
(44) 
(45) 
where 

(46) 


Vsem          ^sem 
Ak       _  nk 

Y"sem         ^sem 
Akx           nk 

Xk             7lk 

Xkx             7lk 

ysem         ^sem 
Akh           nk 

Xkh            7lk 

wiem 

—    1 

psem 

Bh 

w, 

say,  we  have 

xr  =  PTX*  x**  =  Ptmx^ 

^S"  =  Pfxm 

Wfm  =Wh        B 

r  =  b„ 

PT- 

=<m 

7li 


the  pkm  are  index  numbers  of  prices,  with  the  base  situation 
chosen  as  the  situation  in  relation  to  which  the  volume  figures 
are  defined.  These  index  numbers  have  a  meaning  even  if  no 
strictly  physical  quantities  are  defined.  For  pkm  =  1  the  semi- 
volume  figures  are  equal  to  the  volume  figures. 

The  number  2n  of  degrees  of  freedom  is  not  changed  by  in- 
troducing the  semi-volume  figures  through  (31) — (34)  or  by 
introducing  the  price  indices  through  (46).  Indeed,  to  each 
new  magnitude  introduced  corresponds  a  definitional  equation. 

To  unfold  the  In  degrees  of  freedom,  we  may  use  the  volume 
figures  Xk  and  the  residual  rates  skem  defined  by  (36)  (the  s'k  are 
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fixed  as  mentioned  above,  see  in  particular  the  comments  to 
(13) — (15)).  Instead  we  may  use  as  basis  variables  the  Xk  and 
the  pskem.  Or  the  Xskem  and  the  Xk.  Or  some  other  linearly  in- 
dependent set  of  2n  of  the  variables  entering  into  the  complete 
set  up. 

From  (35)  we  get  by  (46),   (18)  and  (19) 

(47)  #em(l  -  %sem)  -  tTKi  =  K '+  B[ 

-  PimK2  +  £s2em(i  -  4sem)  =  K  +  K 

The  coefficient  X'12>  X'21  etc.  in  (47)  are  determined  by  (16)— 
(17)  applied  in  the  base  year  situation.  The  pkm  are  therefore 
well  defined  as  functions  of  the  skem.  The  generalization  to  n 
sectors  is  obvious. 

We  could  also  have  considered  the  semi-volume  residual  rates 
in  the  form 

gsem 

(48)  er  =  -~- 

By  (36)  and  (40)  this  is  the  same  as 

(49)  g*r  =  fire,1r. 

With  the  e*em  given  (47)  takes  the  form 

(50)  p™  -  rrKi  =  K  +  s;  +  ^em 

-PtTX'u  +  fr  =  K  +  K  +  *7m 

Note  the  analogy  —  and  also  the  difference  —  between  the 
equations  (47)  and  (50)  on  one  hand  and  on  the  other  the 
equations  (22)  and  (9),  and  also  the  equations  (35)  and  (37). 

For  ef m  =  4  we  should  by  (46)  §et  ps™  =  p%m  =  1.  That 
this  is  in  fact  so,  is  seen  by  inserting  these  values  for  the  pkm 
and  comparing  with  (24). 

The  solution  of  (50)  is 


sem 


(51)      ft  1  -  X'aX'u 

(W[  X'12  +W'2)  +  (B[  X'u  +  B'2)  +  (ir  X'xi  +  «J 


Km  = 


^12^21 


Again  the  generalization  to  n  sectors  is  obvious.  Instead  of 
(51)  we  get 
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(52)  i  firV  -  X'U  =W'h  +  B'h  +  «T     (h=l,2...n). 
The  solution  of  this  is 

(53)  fiT  =  1  (W'h  +  B'h  +  Cm)  (»  -  X')ll     \k  =  1,  2  .  .  .  «)'. 

The  last  formula  suggests  immediately  the  following  three 
component  parts  of  the  price  p 


sem 


PFjj.  =  ^  W&(^— ^')wb  due  to  labour  input  anywhere  in 
h=1  the  system. 

n 

(54)  £'&  =  J  ^a(^—^')m   due  to  imports  anywhere  in  the 

h=1  system. 

n 

8Tm  —  2  Ghm(d—X')hk  due  to  residual  input  anywhere 
71=1  in  the  system. 

The  last  expression  in  (54)  is  aggregate  residual  substance  in  Xk 
reckoned  per  unit  of  Xk.  It  satisfies  the  equations 

(55)  £;sem  -  e£emX'21  =  slem 
P'sem  v'sem  4-  p's 

fcl        ^12  i     fc2 


•sem 


where  as  before  £|em  is  direct  residual  input  reckoned  per  unit 
of  Xk.  These  equations  are  analogous  to  (26).  In  both  cases  the 
residual  substance  is  reckoned  per  unit  of  the  sector  product 
measured  in  volume  figures.  In  n  sectors  (55)  is  written 

(56)  |  eTm(d  -  X')kh  =  er     (h  =  1,  2  ...  n). 
fc=i 

G.  The  Semi-Volume  Coefficients  and  a  Modified  Definition  of 
the  Sector  Products 

In  analogy  with    (16) — (17)   let  semi- volume  coefficients  be 
defined  by 

(57)  X* 


sem 


kh 


&ft              vsem 
/kq\  TT/'sem  ft  p'sem  ft 
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Note  in  this  connection  (42). 

Inserting  from  (43) — (44)  into  (57) — (58),  we  get 

Asem  ^sem  V" 

/kq\  v'sem  "  &       y'       •  _       y'sem  rk  kh 

(59)  Am     =  — -Akh  i.e.   Akh     = 

Ph  Ph   Ah 

(60)  T^Isem  =  —  W'  and  B'sem  -  —  B'. 

\        I  n  /Asem         "'  ""  /Asem 

This  shows  that  if  the  volume  coefficients  Xkh  are  constant, 
the  semi-volume  coefficients  cannot  be  constants  under  changes 
in  residual  rates,  because,  such  changes  will  by  (51)  make  the 
price  indices  pskem  change.  There  would,  of  course,  be  no  logical 
inconsistency  in  assuming  the  semi- volume  coefficients  constant, 
but  then  the  volume  coefficients  would  change  under  the  changes 
in  residual  rates. 

The  real  question  at  issue  is  to  know  which  is  the  most 
realistic  assumption. 

If  we  assume  such  a  market  organization  and  such  a  techno- 
logical structure  that  an  increase  in  the  price  of  a  product  will 
cause  an  equally  large  relative  decline  in  its  use  for  cross  delivery 
as  well  as  for  total  delivery,  then  the  semi-volume  coefficient 
would  be  constant  while  the  volume  coefficient  would  change. 
This  is  seen  from  (59)— (60).1) 

But  if  we  can  assume  fixed  coefficients  in  the  strictly  physical 
structure,  then  the  volume  coefficients  must  be  constant.  In 
what  follows  I  will  assume  constant  volume  coefficients. 

Then  a  second  question  arises:  Can  we  modify  the  definition 
of  sector  product  in  such  a  way  as  to  compensate  for  the  varia- 
bility in  semi-volume  coefficients? 

An  obvious  answer  is  that  if  the  volume  coefficients  Xkh  are 
known  and  also  the  residual  rates  —  either  in  the  form  e^™  or 
in  the  form  ekm  —  the  price  indices  pskem  will  follow  by  (47)  or 
(50).  Hence  we  can  always  by  (43)- — (44)  compute ' 'compensated' ' 
variables  —  namely  the  Xk  and  the  Xkh  < —  which  are  such  that 
they  will  be  connected  by  the  constant  volume  coefficients. 
This  procedure  is  however  highly  non  linear  and  it  does  not  seem 

l)  How  realistic  such  a  case  would  be,  is  another  question. 
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very  promising  to  proceed  to  a  study  of  the  semi-volume 
variables  along  such  lines.2) 

Starting  from  the  concepts  of  semi-volume  figures  it  is,  however, 
possible  to  introduce  certain  modified  definitions  of  sector 
products  and  cross  deliveries  which  are  such  that  they  are 
approximately  related  through  the  constant  volume  coefficients. 

To  arrive  at  such  a  formulation  we  will  first  rewrite  the 
expressions  (51)  in  the  forms 

._    ,  ,  (r-^+cr-^ 

Pi      =H i Y>    y 

,61*  l   —  ^12^21 

y*     —  L  i 


-sem 
'2 


1  —  x12x21 

The  first  of  these  equations  follows  by  writing  the  numerator 
in  the  first  equation  of  (51)  in  the  form 

(62)  (Wi+Bi+e[)+X'n(Pr'%+B'%+4) 

+(«r-80+xi«--«;). 

The  first  and  second  parenthesis  here  are  respectively  (1—  X'21) 
and  (1  —  X'12)  by  (24).  This  part  of  (62)  therefore  becomes 
(1  -  X'21)  +  X'21{1  -  X'12)  =  1  -  X'12X2V  This  establishes  the 
first  equation  in  (61).  Similarly  for  the  second  equation  in  (61). 

As  a  check  on  (61)  we  see  that  pfm  and  ps2em  reduce  to  1  if 
ef  m==  E[  and  e|em  =  e2. 

In  the  case  of  n  sectors,  we  have 

(63)  pr  =  1  +  i  W  -  <)  («  -  X')ll     (k=l,2...n). 

In  the  regular  case  the  coefficient  of  (elem  —  s^)  in  the  first 
equation  of  (61),  namely  1/1  —  X'12X21  (in  general:  the  diagonal 
element  (S  —  X')j£)  will  be  sUghtly  above  unity,  while  the 
coefficient  of  (e2m  —  e2)  in  the  first  equation  of  (61)  will  be 
small  since  it  is  multiplied  by  the  coefficient  X21,  and  may 
therefore  be  neglected  in  a  first  approximation.  Similarly  in 
the  second  equation  in  (61).  That  is,  we  have 

(64)  ^em  =  l+^em_£;;     (approximately)     (k  =  1,  2  . . .  n). 

2)  The  prices  are  by  (50)  linear  in  the  e  |em,  but  by  (47)  non  linear  in  the 
gjBem.  In  any  case  the  deflation  by  the  prices  makes  the  set  up  non  linear. 
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In  other  words,  as  a  first  approximation  the  price  pkm  depends 
only  on  the  residual  rate  sk  and  not  on  the  other  residual  rates. 
And  the  relation  is  a  simple  addition.  The  "dependency"  we 
speak  of  now  is  an  (approximate)  accounting  dependency  which 
hold  good  regardless  of  behaviouristic  relations.1) 

Multiplying  (68)  by  Xk,  we  get 

(65)     Xsr-er  =  Xk(l-e'k)    (approximately)  \k  =  1,  2/... n). 

The  input-output  coefficient  in  semi  volume  figures,  i.e. 
^L^em  as  defined  by  (57)  is  not  constant.  There  is,  as  is  seen  from 
the  left  hand  expression  in  (59)  a  correction  to  be  applied  in  the 
numerator  as  well  as  in  the  denominator  in  order  to  reach 
something  that  is  constant.  The  correction  in  the  denominator 
can  be  done  with  the  approximation  (65)  simply  by  using  the 
left  hand  expression  in  (65)  to  measure  the  sector  product 
instead  of  Xkem. 

We  will  first  consider  the  case  where  we  make  this  denominator 
correction  without  making  the  numerator  correction.  Is  this  a 
sound  procedure? 

By  analogy  consider  the  difference  (x±  —  x2)  between  two 
stochastic  variables.   The  variance   of  this  difference  will  be 


equal  to  var.  xx  +  var.  x2  —  2r\/var.  xt  •  var.  x2  where  r  is 
the  correlation  coefficient.  This  expression  is  larger  than  var.  x1 
if,  and  only  if  Vvar.  a;2/var.  x±  >  2r .  Therefore,  if  we  know 
that  var.  x2  is  appreciably  larger  than  var.  xv  it  will  pay  to 
correct  x2  ■ —  that  is  making  it  non  stochastic  —  even  if  we  do 
not  correct  xv  And  this  will  apply  regardless  of  the  nature  of 
the  correlation,  whether  positive  or  negative. 

In  our  case  the  question  is  if  we  shall  correct  for  ps£m  in  the 
denominator  of  the  expression  to  the  right  in  (59)  even  if  we  do 
not  correct  for  pskm  in  the  numerator.  We  know  that  a  change 
in  p*m  will  produce  a  change  in  pkm in  the  same  direction  (positive 
correlation),  but  the  change  in  pkm  will  be  proportionally  much 
smaller  if  there  are  many  highly  intertwined  sectors.  Hence  we 

x)  It  is  the  equation  itself,  i.e.  (64)  —  or  more  exactly  (61)  —  which  has 
accounting  character.  This,  of  course,  does  not  prevent  one  or  more  of  the 
variables  from  entering  into  some  other  relations  that  are  behaviouristic.  The 
expressions  "accounting"  vs.  "behaviouristic"  can  be  used  about  a  relation, 
not  about  a  variable. 


PRICES   VS.    COSTS    AND    PRODUCTION    COEFFICIENTS  211 

ought  to  get  a  more  correct  result  by  correcting  for  fis*m  even 
if  we  do  not  do  it  for  ^em. 

The  above  argument  is  particularly  adapted  to  the  case  where 
there  is  a  change  in  the  residual  rate  in  a  single  sector.  To  some 
extent  a  similar  reasoning  can  be  applied  in  succession  to  any 
of  the  sectors.  In  each  step  the  correction  contemplated  will  be 
better  than  nothing.  But  it  is  quite  clear  that  if  all  residuals 
change  simultaneously,  there  may  occur  cases  where  it  would 
have  been  better  to  make  no  corrections  at  all  in  the  variables. 

For  instance  if  all  4sem  are  equal  —  i.e.  4sem  independent  of 
k  • —  we  see  from  (72)  that  we  obtain  a  better  approximation 
by  not  making  any  corrections  on  the  variables,  because  in  this 
case  Xsm  is  a  constant  times  Xs£m.  The  constant  is  equal  to 
(1  —  4/1  —  e'h)X'kh. 

On  the  other  hand  if  (s^em  —  e^)  is  independent  of  k,  and  hence 
by  (64),  pfm  independent  of  k,  we  see  from  (57)— (59)  that 
Xf™  is  again  a  constant  times  Xh,  but  now  the  constant  is  simply 

XL- 

These  cases  where  the  semi  volumes  figures  themselves  are 

connected  by  constant  coefficients  are,  however,  very  special. 

They  resemble  the  case  where  the  residual  rates  are  constant 

and  we  get  the  volume  figures. 

If  we  want  an  approximation  that  holds  —  at  least  roughly 
—  for  any  changes  in  residual  rates  —  in  particular  for  changes 
with  a  small  covariance  between  the  individual  residual  rates  — 
the  correction  of  the  denominator  to  the  right  in  (59)  —  which 
leads  to  (68)  —  seems  to  be  a  workable  formula. 

The  correction  for  ffim  can  be  achieved  simply  by  starting 
from  the  exact  relation 

(66)  X£T  =  prX'aXh 

and  introducing  here  the  expression  for  Xh  taken  from  (65). 
This  gives 


(67)  ZJT  =  P 


sem 


(approximately). 


Dropping  at  this  stage  the  correction  pfm,  we  can  write 
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(68)  I»=fe]Km-£»m)     (approximately). 

The  expression  in  bracket  is  a  constant  and  can  be  determined 
from  the  data  in  the  base  year.  (If  the  sector  product  has  been 
defined  as  (Xh  —  sh)  already  in  the  base  year  where  the  coef- 
ficients were  determined,  the  value  of  the  bracket  will  emerge 
directly.) 

This  procedure,  while  rough  has  the  great  advantage  that  it 
keeps  the  model  linear,  and  it  will  as  a  rule  —  compare  the 
discussion  above  —  at  least  be  better  than  simply  to  put 

(69)  Xf™  =  X'khXr     (incorrect) 

in  a  case  where  the  residual  rates  do  not  remain  constant. 

It  is  possible  to  make  the  first  order  correction  also  for  the 
factor  pk  in  the  numerator  to  the  right  in  (59)  but  then  the 
model  does  not  remain  linear.  Indeed  we  have 


(70) 

tr  = 

xr 

xk 

(exactly) . 

Introducing 

here  for  X, 

b  from 

(65),  we  get 

(71) 

Vk       —   v 

■v-sem 
sem 

sem  t1           £k) 

And  inserting  this  in  (67),  we  get J) 
(72)         X~  = 


Xr        HI  -  e'k)X'kh 
L      1  -  el      . 


\rseia 

^k       ~ '  °&       "-        x   ~  ch 


(approximately) . 

/  ysem       0sem  \ 

(approximately). 


H.  The  Formulation  in  Non-Residual  Cost 

Let  us  introduce  the  aggregate  non-residual  fart  of  the  price 
•of  the  output  from  sector  k.  This  is  that  part  of  pskem  which  is 
due  to  the  input  of  primary  factors  (in  Table  4)  labour  and 
imports).  This  part  of  pkem  is  given  as  the  first  two  terms  in 
(54)  which  —  in  the  case  n  —  2  —  are  given  by  the  terms  in 

x)  Dividing  bythe  first  fraction  in  (72),  the  left  member  becomes  X|«m(l  -e*sem) 
If  this  is  taken  as  definition  of  a  corrected  cross  delivery,  we  get  a  relation  with 
constant  coefficient.  But  this  relation  is  not  linear  in  Xl\m,  Xlem  and  e|em. 


xr  =  prxk 

ynori          ^nori  v 
A&*     —  Pk      Aft* 

XT  =  PT"Xkh 

w™  =  wh 

BTri  =  Bh. 
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(51)  that  do  not  depend  on  £fm  and  £2sem. 
We  denote  this  part 

(73)     pr  =  I(W'h+  B'h)  (d  -  X%i     {k  =  1,  2  .  .  .  «). 

fc  =  l 

As  long  as  the  production  coefficients  reckoned  in  volume 
figures  are  constant,  the  prices  (73)  are  constant.  The  correspond- 
ing non  residual  parts  of  the  sector  products,  cross  deliveries 
and  final  deliveries  are 

(74) 

(75) 

(76) 

Since  these  non  residual  parts  of  the  volume  figures  are  simply 
proportional  to  the  volume  figures,  nothing  is  gained  by  working 
with  these  variables  instead  of  the  volume  figures.  Both  sets  of 
variables  will  exactly  satisfy  relations  with  constant  coefficients, 
provided  the  volume  structure  coefficients  are  constant.  No 
further  attention  will  therefore  be  paid  to  the  structure  in  non 
residual  parts. 

I.  Formulation  in  Factor  Costs 

Finally  we  will  consider  a  breakdown  of  4em  m  the  two  parts 

(77)  4em  =  <5r  +  7-»  (k  =  1,  2.   .  .  .  n) 

where  dskem  stands  for  profits  and  7|em  for  taxes.  More  specifically 
we  may  interpret  dskm  as  profits  before  the  deduction  of  direct 
taxes,  so  that  Tkm  will  stand  for  indirect  taxes. 

We  have  now  n  more  degrees  of  freedom,  i.e.  Sn  degrees 
altogether.  They  may  be  represented,  say,  by  the  Xk,  and  the 
rates  e|em  and  T sfcem,  where  efm  is  defined  by  (48)  and 

ssem  T-sem 

/rrQ\  "ssem  *  ffisem.  & 

V78)  Ok       ———  lk       —  — — 

Ak  Ak 

so  that 


sem 


Assuming  that  the  coefficients  in  the  volume  structure  are 
constant,  we  know  that  if  the  £|em  are  changed,  the  pkm  must 
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follow  in  the  way  previously  discussed.  This,  however,  does  not 
say  anything  about  the  way  in  which  the  prices  will  change  if 
the  Tskem  change.  Conceivably  any  change  in  the  Tskem  may  be 
compensated  by  opposite  changes  in  the  dkm  so  that  the  sskem 
remain  constant  and  hence  the  pfm  constant.  Or  a  smaller  or 
larger  part  of  the  change  in  Tkm  may  be  absorbed  in  the  prices. 
In  a  market  of  a  more  or  less  conventional  sort  it  is  perhaps 
plausible  to  assume,  as  a  very  simple  case,  that  the  Tkm  will 
affect  the  prices  directly  and  fully  in  the  sense  that  we  have 

(79)  pskem  =  1  4-  T |em  -  T'k   (approximately)     (k  =  1,  2  .  .  .  n) 
where 

T 

(80)  T'k  =  — -  =  indirect  tax  rate  in  the  base  year. 

If  this  is  so,  we  get  by  a  reasoning  analogous  to  that  connected 
with   (64)— (65). 

(81)  XT  ~  Tsr  =  X*{1  ~  T'h)      (approximately) 

k=  1,2  .  .  .n). 

so  that  in  analogy  with  (71) — (72)  we  may  put 

(82)  X%?  =  [~r]  (XT  -  TT)     (approximately). 

The  expression  in  brackets  is  a  constant  to  which  we  may 
attach  comments  similar  to  those  connected  with  (68). 

The  set  up  (81)  has  been  used  in  the  Oslo  median  model.1) 

J.  Formulation  in  current  values 

If  we  consider  also  the  factor  prices  as  variables,  we  are  led 
to  the  concepts  X™  =  pkXk,  Z^r  =  pkXkh,  etc.  Note  that 
pskm  stands  for  the  price  concepts  that  emerge  when  the  factor 
prices,  i.e.  the  price  concepts  for  Wk  and  Bk,  are  constant,  while 
pk  stand  for  the  corresponding  concepts  when  the  wage  rate  and 
the  import  prices  may  change.  I.e.  that  emerge  when  the  factor 
price  concept  is  expressed  by  a  general  wage  index  q  (with 

1)  The  Oslo  median  model  contained  several  specifications  that  go  beyond 
those  considered  here,  but  in  essence  we  can  say  that  what  is  here  denoted 
(X\«m  -  T*e m)   and  X**m  respectively,  was  there  denoted  Xh  and  Xm. 
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q  =  1  in  the  base  situation)  and  the  import  prices  are  arbitrarily 
given.  Correspondingly  the  X™1  and  the  X%j*  are  current  values 
as  they  emerge  when  applying  the  general  price  indices  pk. 
When  the  current  values  are  deflated,  we  get  back  to  the 
volume  figures.  For  any  individual  sector  product  this  deflation 
is  simply 

ycur  V"cur 

For  the  global  output  as  a  whole  we  have 

vcur     I      vcur 

(84)  defl.  {X™  +  Zcur^ 


2  ;       Pr.  ind.  (Xiur  +  ^D  ' 
If  we  use  a  Laspeyre  price  index,  (84)  is  further  reduced  to 

V"cur     i      Vour 

(85)  defl.  (X?  +  XT')  =  -    *    ~  -  *      =X,  +  X2. 

PlXl  +  p2X2 

"  x,  +  x2 

A  similar  reduction  takes  place  if  we  deflate  the  semi  volume 
figures.  That  is  to  say,  in  order  to  arrive  at  a  measurement  of 
the  global  output  that  is  independent  of  such  price  effects  as 
may  be  produced  by  residual  input  elements,  or  more  specifically 
that  part  of  these  elements  which  is  represented  by  indirect 
taxes,  it  is  not  necessary  to  use  measurements  as  (X^  —  e£ur) 
or  (X™T  —  T™T)  for  the  sector  products  (compare  by  analogy 
(65)  and  (81)).  We  can  use  the  total  value  Xfm  and  afterwards 
deflate. 

Another  aspect  of  this  question  is  that  such  differences  as 
(X™  -  4ur)  or  (X™  -  T™r)  only  represent  first  order  correc- 
tions. The  complete  correction  is  obtained  by  computing  the 
prices  pv  ft2  by  formulae  analogous  to  (51)  or  (53)  —  or  ob- 
serving them  —  and  then  using  these  prices  for  the  deflation 
process. 


LAWRENCE  R.  KLEIN 


The  Efficiency  of  Estimation  in  Econo- 
metric Models 


At  an  early  stage  in  the  development  of  mathematical  statistics 
and  in  its  application  to  economics  Professor  HoteUing  steered 
our  thinking  in  the  right  direction  towards  the  relationship 
between  problems  of  parameter  estimation  from  samples  and 
problems  of  prediction  outside  the  confines  of  the  sample.  The 
essence  of  the  matter  is  contained  in  his  1929  paper  written 
jointly  with  H.  Working.1)  A  more  concise  presentation  was 
given  later  in  Hotelling's  appraisal  of  a  research  publication 
in  psychology  and  sociology.2)  Hotelling's  formulation  of  the 
prediction  problem  will  long  serve  as  a  model  in  econometrics. 
In  following  up  recent  contributions  on  the  efficiency  of  alter- 
native methods  of  estimation  in  econometrics,  it  is  found  im- 
mediately useful  to  turn  to  Hotelling's  basic  formula  for  the 
standard  error  of  forecast. 

First  I  shall  take  up  some  problems  in  connection  with  the 
use  of  conventional  least-squares  methods  of  estimation  in 
econometrics  and  then  go  on  more  specifically  to  the  problems 
of  prediction  which  rely  so  heavily  on  Hotelhng's  inspirational 
work. 

A  Revival  of  Support  for  Least-Squares  Estimates 

Recent  arguments  in  favor  of  reversion  to  conventional  ap- 
plication of  the  method  of  least  squares  for  estimating  para- 
meters in  single  equations  in  simultaneous  systems  dealt  with  in 
econometrics  have  stressed  the  comparative  efficiency  of  this 

x)  H.  Hotelling  and  H.  Working,  "Applications  of  the  Theory  of  Error 
to  the  Interpretation  of  Trends,"  Journal  of  the  American  Statistical  Association, 
Vol.  24   (March  Supplement,   1929),  pp.  73—85. 

2)  H.  Hotelling,  "Problems  of  Prediction,"  The  American  Journal  of  Sociology, 
XLVIH   (1942),  61—76. 
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method.  For  example,  in  the  discussions  at  the  1954  meetings 
of  the  Econometric  Society  in  Uppsala,  H.  Theil  outlined  a 
theorem  showing  that  the  generalized  variance  of  least-squares 
estimates  of  the  parameters  in  a  single  equation  is  at  least  as 
small  as  that  of  limited-information-maximum-Hkehhood 
estimates.3)  It  has  long  been  suggested  that  it  may  be  rewarding 
to  gain  some  efficiency  at  the  expense  of  bias  by  using  the  least- 
squares  method,  but  the  formal  proof  of  the  superior  efficiency 
has  only  just  been  set  out  by  Theil.  Of  course,  we  must  know 
more  about  the  "utility  function"  of  the  users  of  estimated 
models  before  we  can  judge  about  the  relative  importance  of 
bias  and  efficiency. 

In  this  paper,  I  should  like  to  suggest  that  Their  s  results  and, 
in  fact,  much  of  the  discussion  on  the  relative  merits  of  different 
methods  is  misplaced.  In  systems  of  equations,  with  several 
parameters  in  each  equation,  it  is  misleading  to  look  at  in- 
dividual parameters,  one  by  one,  or  even  restricted  groups  of 
them  in  reaching  overall  judgments  about  the  importance  of 
bias  or  efficiency.  Users  of  econometric  models  are  often  not 
really  interested  in  particular  structural  parameters  by  them- 
selves. They  are  interested  in  the  solution  to  the  system,  under 
alternative  sets  of  conditions.  In  other,  more  technical,  words 
they  are  interested  in  the  reduced  forms  of  the  estimated 
system.4)  It  is  the  difference  between  partial  and  general  analysis 
that  is  involved.  It  is  conceivable  that  partial  analysis  is  an  end, 
in  itself,  for  some  problems  —  possibly  those  of  a  purely  peda- 
gogical nature  —  but  most  problems  call  for  a  more  complete 
analysis  of  the  system. 

The  transformation  of  a  structural  system  to  its  reduced  form 
can  be  associated  with  the  process  of  forecasting.  We  shall  use 


3)  "Report  of  the  Uppsala  Meeting,  August  2 — 4,  1954,"  Econometrica, 
Vol.  23,  (April,  1955),  pp.  204 — 5.  Limited  Information  and  other  methods  of 
estimation  in  econometric  models  are  described  in  various  publications.  A  good 
summary  is  found  in  Wm.  C.  Hood  and  T.  C.  Koopmans,  "The  Estimation  of 
Simultaneous  Linear  Economic  Relationships,"  Studies  in  Econometric  Method, 
(New  York:  John  Wiley  and  Sons,   1953). 

4)  Theil  points  out  to  me  that  for  treatment  of  structural  change  and  ap- 
praisal of  the  a  priori  "reasonableness"  of  parameter  estimates  one  would  be 
primarily  interested  in  the  individual  structural  estimates. 
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this  term  in  a  general  sense  in  this  paper,  that  is  in  the  sense 
of  making  estimates  of  endogenous  economic  magnitudes  outside 
the  realm  of  past  experience.  In  this  sense,  a  wide  variety  of 
problems  of  empirical  economic  analysis  are  forecasting  problems. 
Propositions  valid  for  partial  systems  may  not  carry  over  when 
complete  systems  are  studied.  Or  even  propositions  valid  for 
structural  parameters  may  not  be  valid  for  reduced  form  para- 
meters. A  beautiful  property  of  the  maximum-likelihood  method 
of  estimation  is  that  its  characteristics  are  preserved  under 
single- valued  transformation  of  variables.  Let  0  be  an  unknown 
parameter,  0  its  maximum-likelihood  estimate,  and  f\B)  a 
single- valued  transformation  function.  Then  it  follows  that  the 
maximum-likelihood  estimate  of  f{0)  is  given  by  f(0).  As  a 
result  of  this  proposition,  the  desirable  features  of  maximum 
likelihood  estimates  of  structural  parameters  remain  as  desirable 
features  when  the  parameters  and  estimates  are  transformed  into 
reduced  form  coefficients.  An  analogous  property  does  not  hold 
for  the  method  of  least  squares  in  general. 

Bias-Structural  Equations  and  Reduced  Forms 

First,  let  us  consider  the  question  of  bias.  By  comparing  one- 
by-one  the  coefficients  of  a  system  estimated  by  the  method  of 
least  squares  with  corresponding  coefficients  estimated  by  some 
unbiased  method,  investigators  sometimes  conclude  superficially 
that  the  amounts  of  bias  are  unimportant.  From  studies  of 
numerical  methods  of  solving  linear  equation  systems  with 
parameters  subject  to  error,  we  learn,  however,  that  small 
errors  in  coefficients  may  lead  to  sizeable  errors  in  the  final 
solution. 

Suppose  that  we  have  a  linear  equation  system 

(1)  Byt  +  rzt  =  ut,      y  jointly  dependent, 

z  predetermined, 

with  reduced  form 

(2)  yt=-B~irzt  +  B-iut. 

We  shall  denote  a  set  of  unbiased  or  consistent  estimates  as 

A. 

B  and  r.  Least-squares  estimates  will  then  be  written  as 
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B  +  D(B),  f+D{r). 

D{B)  and  D(r)  are  discrepancy  matrices  showing  how  the  least- 
squares  estimates  differ  from  the  set  B,  r.  For  a  given  zt- vector 
we  shall  then  be  interested  in  the  effect  of  the  discrepancies  on 
the  solution  vector. 

A  first  order  approximation  to  the  solution  of 

[B  +  D(B)M  +  D(yt)]  +  [f  +  D(r)]zt  =  0 

is  given  by 

(3)  D(yt)  =  -B-i[D(r)zt  +  D(B)yt], 
where 

#t  =  -B-iht. 

Except  for  the  possibility  that  errors  in  the  .T-matrix,  D[T), 
compensate  errors  in  the  B-matrix,  D(B),  these  original  errors 
will  be  reflected  in  errors  in  the  vector,  D  (yt)  after  multiplication 
by  B"1.  In  some  problems  B~x  can  be  a  large  factor,  as  will  be 
illustrated  below  in  a  simple  example. 

Consider,  for  illustrative  purposes,  the  simple  multiplier  model 

(4)  Ct  =  ocY,  +  up 

(5)  Yt  =  Ct+It. 

Ct  =  consumption  (endogenous) 
Yt  =  income  (endogenous) 

It  =  investment       (exogenous) 
ut  =  random  disturbance. 

The  unbiased  estimate  of  the  marginal  propensity  to  consume 
is  a,  and  the  biased  least-squares  estimate  is  a  +  e.5)  The  bias, 
when  looked  at  from  a  partial  point  of  view  is  simply  e,  but  when 
considered  from  the  more  general  point  of  view  of  the  whole 
system  it  is  e/l  —  a  —  e,  as  can  be  seen  from 

5)  Haavelmo  derives  an  explicit  expression  for  the  bias  in  this  model  and 
finds  it  positive  under  fairly  general  conditions.  T.  Haavelmo.  "Methods  of 
Measuring  the  Marginal  Propensity  to  Consume,"  Journal  of  the  American 
Statistical  Association,  Vol.  42   (1947),  pp.   105 — 122. 
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Ct=(a  +  *)(Ct+It) 

a-\-e  r    a  e  1 

W  C>  =  Y^^e1*  =  Ll^  +  (i_«)(i_a_«)J  J< 

= r«  +•— • — l  tt, 

L         1  —  a  —  e_\ 


where 


1  —  # 


In  the  reduced  form,  or  multiplier,  equation  the  bias  is  mag- 
nified by  the  factor  1/1  —  a  —  e,  which  will  be  of  the  order  of 
magnitude  of  the  multiplier.6)  Another  way  of  looking  of  the 
matter  is  to  observe  that  the  percentage  bias  is  smaller  in 
absolute  value  in  the  structural  equation  than  in  the  reduced 
form  equation. 


< 


a  (I  —  a  —  e) 


as  long  as  |1  —  a  —  e\  <  1. 


In  this  simple  case,  it  is  clearly  seen  that  a  discrepancy  in 
the  structural  equation  e,  is  magnified  by  the  multiplier, 
1/1  —  a  —  e.  The  multiplier  plays  the  role  of  B~x  in  the  more 
general  formulation  (3),  showing  the  possibility  of  comparatively 
small  discrepancies  becoming  comparatively  large  in  the  final 
result. 

Least-Squares  Efficiency — Structural  Equation  and  Reduced  Form 

This  simple  model  of  the  multiplier  process  is  useful  in  providing 
an  example  to  show  that  Theil's  proposition  about  the  efficiency 
of  least-squares  methods  cannot  be  extended  to  reduced  form 
parameters.  The  efficiency  properties  of  least-squares  estimation 
of  the  parameter  in  the  structural  equation  are  not  preserved 
under  transformation  to  the  reduced  form,  just  as  we  know  that 
the  point  values  of  least-squares  estimates  vary  with  the  direction 
of  minimization  of  squared  residuals. 

The  variance  of  the  direct  least-squares  estimate  of  the  marginal 

6)  Haavelmo,  op.  cit.,  shows  numerically  how  a  difference  of  .06  in  the 
marginal  propensity  to  consume  becomes  a  difference  of  .68  in  the  multiplier. 
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propensity  to  consume  is  given  by 

var  («  +  «)=  var  =  var ^ 

=  var^+^rj  =  var-^T. 

Applying  a  first  order  approximation  formula  of  a  function 
of  random  variables  to  the  right  hand  expression  we  get  the 
classical  type  result.7) 

var  (u) 

in  the  limit  as  the  sample  size  ->  oo.  In  deriving  this  result  the 
standard  independence  assumptions  are  used  about  ut  and  Yt 
for  unequal  subscript  values.  Thus  for  large  samples  the  classical 
formula  is  used  even  though  ut  and  Yt  are  not  independent  (for 
.equal  subscript  values). 

In  forecasting  Yt  from  estimates  of  the  reduced  form  equation 

1  1 

Yt  =  - It-h- ut, 

1  —  a  1  —  a 

we  are  interested  in  the  variance  of  the  estimated  multiplier 

1 

est. 


<7)  var  (a  +  e)  = 


1  —  a 

The  least-squares  estimate  of  the  structural  equation  leads  to 
an  estimate  of  the  multiplier  as 

1  1 

l.s.est. 


1  —  a       1  —  a  —  e 
while  the  corresponding  unbiased  estimate  will  be  written  as 

m.l.est. = (m.l.  =  maximum  likelihood). 

1  —  a       1  —  a 

We  approximate  the  variance  of  the  least-squares  estimate  as 

7)  This  is,  of  course,  the  same  result  that  Theil  derives  by  a  different  method 
in  his  comparison  of  the  efficiency  of  least-squares  and  limited-information 
estimates.  He  deals  with  the  estimated  variance  determined  from  the  sample 
.observations. 
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1  var  (a  -f-  e) 

var- 


(t^-J™{u) 


,   -  a  —  e      (1  —  a  -  «)4  (1  -  a  -  tf)22Y* 

(i — - — V  var  («)(i;y^ 

\1  —  a  —  0/  var  (w)  ZY  t 


(ZY*-ZCtYtyZYl  (1  -  a  -  «)«  (£/«  Y«)» 

The  maximum-likelihood   estimator  is  simply  that  multiplier 
value  calculated  from  the  least-squares  regression  of  Yt  on  It, 

I     l     V 

/in    lr^)var(M) 
(9)  var(r^-J  = — m — • 

With  a  positive  bias,  e  >  0,  we  have  the  inequality 

var  u  ,.  var  (u) 

<plim- 


(1  -  a)2  -  ^         (1 

since  plim  a  =  a, 

and  we  assume  0<1  —  a  —  e  <  1. 

Also 

1  ZY 

< 


hence 


var 


I I  5^  plim  var  I ) . 

\1  —  a!  \\  —  a  —  e) 


Thus  our  example  shows  clearly  that  the  efficiency  properties 
of  least-squares  structural  estimates  do  not  always  carry  over 
to  the  reduced  form  equations.  This  example  is,  of  course, 
special  as  well  as  simple.  The  maximum-likelihood  estimate 
of  the  reduced  form  equation  is  simultaneously,  or  can  be  trans- 
formed into,  a  least-squares  estimate  of  the  reduced  form,  a 
full-information-maximum-Hkelihood  estimate  of  the  whole 
system,  and  a  Hmited-information-maximum-Ukehhood  estimate 
of  the  consumption  function.  All  three  of  these  identical  estimates 
are  therefore  superior  to  least-squares  structural  estimates,  in  a 
sense,  for  this  model. 
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If  a  model  is  exactly  identified,  there  exists  a  set  of  least- 
squares  estimates  that  provide  efficient  estimates  of  the  reduced 
forms.  These  are  also  maximum-likelihood  estimates  of  the 
reduced  form  and  can  be  transformed  into  maximum-likelihood 
estimates  of  the  structural  parameters.  They  are,  however,  very 
particular  least-squares  estimates  and  not  those  that  are  custom- 
arily obtained  from  the  separate  treatment  of  each  structural 
equation.  The  least-squares  estimates  of  the  structural  equation, 
in  the  form  usually  made,  do  not  in  general  preserve  their  effi- 
ciency properties  under  transformation  to  reduced  form  para- 
meters. To  put  this  in  concrete  terms  of  the  simple  model 
considered  in  this  section,  we  note  that  the  least-squares  estimate 
of  the  marginal  propensity  to  consume  does  not  provide  an 
efficient  estimate  of  the  multiplier  (although  a  least-squares 
estimate  of  the  multiplier  would,  in  fact,  be  efficient).  In  over- 
identified  systems,  the  lack  of  invariance  of  the  efficiency 
property  under  transformation  proves  to  be  more  important. 

In  the  next  section,  we  shall  compare  efficiency  of  least- 
squares  and  full-  and  limited-information-maximum-Hkelihood 
estimates  of  the  reduced  forms,  in  more  general  cases,  including 
overidentification.  In  another  section,  however,  we  shall  present 
some  results  of  a  sampling  experiment  with  small  samples  and 
overindentification,  which  show  clearly  the  failure  of  least- 
squares  estimates  to  retain  efficiency  under  transformation. 

The  Efficiency  of  A  Priori  Restrictions 

It  has  frequently  been  remarked  that  by  imposing  a  priori 
restrictions  on  an  economic  model  we  gain  efficiency  of  estimation 
because  more  information  is  brought  to  bear  on  the  problem 
than  is  the  case  in  the  absence  of  such  restrictions.8)  It  is  the 
purpose  of  this  section  to  give  a  formal  proof  of  this  point  of 
view  and  to  interpret  it. 

Liu  has  posed  the  seemingly  paradoxical  proposition:  Least- 
squares  estimates  of  the  reduced  form  parameters  in  linear 
systems,  ignoring  all  a  priori  restrictions,  lead  to  a  higher  point 
on  the  likelihood  function,  assuming  normally  distributed  disturb- 

8)   See  e.g.,  T.  C.  Koopmans  and  Wm.  C.  Hood,  op.  cit.,  esp.  p.    176. 
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ances,  than  do  restricted  maximum-likelihood  estimates; 
therefore  the  unrestricted  estimates  are  to  be  preferred.9)  Put 
in  another  way,  he  observes  that  squared  discrepancies  between 
predicted  and  actual  values  of  endogenous  variables  in  a  linear 
model  will  be  smaller  over  the  sample  period  if  calculated  from 
(unrestricted)  least-squares  estimates  of  the  reduced  form 
equations  than  if  calculated  from  any  set  of  estimates  of  structural 
parameters  using  a  priori  restrictions.10)  If  the  least-squares 
values  of  the  reduced  forms  give  better  "predictions"  over  the 
sample  period,  should  we  not  expect  them  to  give  better  predic- 
tions outside  the  sample? 

While  the  least-squares  estimates  of  the  reduced  forms  are 
consistent  and  while  they  lead  to  minimal  squared  residuals; 
they  do  not  lead  to  efficient  estimates  of  the  reduced  form 
parameters.  We  shall  now  turn  to  the  proof  of  the  proposition 
that  the  more  one  uses  valid  information  in  the  form  of  a  priori 
restrictions  imposed  on  the  system,  the  more  efficient  are  the 
estimates. 

Let  us  write  a  linear  model  as 

(1)  Byt  +  nt  =  ut, 
with  reduced  form 

(2)  yt=-B-irzt  +  B-iuP 

yt,  zt  and  ut  are  column  vectors  with  n,  m,  and  n  components, 
respectively.  B  is  a  square,  nonsingular,  matrix  of  order  nxn, 
while  r  is  rectangular  of  order  nxm. 

We  can  also  write  the  reduced  form  equation  as 

(10)  yt  =  nzt  +  vt> 

obscuring  the  relation  between  its  coefficients  and  those  of  the 

9)  T.  C.  Liu,  "A  Simple  Forecasting  Model  for  the  U.  S.  Economy,"  Inter- 
national Monetary  Fund  Staff  Papers,  IV  (August,  1955)  464 — 66,  Liu's  ar- 
gument is  actually  more  in  terms  of  a  supposed  inherent  tendency  towards 
underidentification  in  models  of  an  economic  system  than  in  terms  of  the 
point  reached  on  the  likelihood  function  with  and  without  restriction.  He  uses 
the  latter  point,  however,  in  justification  of  his  claim  of  lack  of  identifiability. 
His  general  conclusion  is  that  we  can  do  no  better  than  to  make  unrestricted 
least-squares  estimates  of  the  reduced  forms. 

10)  In  the  case  of  exact  identification,  least  squares  estimates  of  the  reduced 
form  parameters  will  coincide  with  fully  restricted  maximum  likelihood  estimates. 
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structural  equations.  On  multiplying  both  sides  of  this  equation 
by  B,  however,  we  find 

(11)  Bn=-r, 

by  equating  coefficients  of  like  variables  in  the  reduced  form  and 
structural  equations.  Some  elements  of  r  are  zero;  therefore 
BII  has  the  same  zero  elements.  These  are  the  restrictions. 
Let  us  write,  symbolically, 

(12)  (BII)r  =  0 

to  show  that  r  elements  of  r  are  zero,  and  that  corresponding 
elements  of  BII  are  zero. 

The  logarithm  of  the  likelihood  function  of  the  entire  system 
can  be  written  as 


(13)        L  =  const.  -  -log  \QV\  -\^v\Q-xvt  +  A' (£77), 


<=i 


Qv  =  matrix  of  variances  and  covariance  of  elements  of  vt. 
X  is  a  vector  of  Lagrange  multipliers  with  r  non-zero  elements. 
If  there  were  no  restrictions  on  the  system,  variances  and 
co variances  of  est.  77  are  given  by 


(14) 


d2L 


GijM. 


o^MZ1 


aij  ==  typical  element  of  Q~x. 

a..  =  typical  element  of  Qv. 

Mzz  =  moment  matrix  of  predetermined  variables,  zt. 

The  elements  of  this  inverse  matrix  (14)  are  the  variances 
and  covariances  of  unrestricted  least  squares  estimates  of  77. 
For  any  particular  equation  in  this  complete  set  of  reduced 
forms,  the  appropriate  variance-co variance  matrix  is  a^M^. 
To  determine  the  variance  of  forecast  for  any  endogenous 
variable,  outside  the  sample  values,  we  have,  by  HoteUing's 
formulation  of  the  problem  of  prediction  error, 

(15)  S\  =  ai4(l  +  zFM-}zp)t 

where  zF  denotes  a  vector  of  values  assigned  to  predetermined 
variables  in  the  forecast  period. 
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Suppose  now  that  we  have  r  restrictions  on  the  maximization 
of  the  likelihood  function,  then  the  variances  and  covariances 
of  est.  IT  are  given  by  the  N.W.  principal  minor,  A,  on  the 
right  hand  side  of  the  following  expression 


(16) 


d2L 

A 

-l 

A 

A 

A 

dnik  dnn 

A' 

0 

A' 

0 

The  bordering  matrix  A  consists  of  elements  of  X  obtained  as 
—  d2Lld7zijdbkl,  where  bkl  are  elements  of  B. 

Theorem:  Consider  an  mxm  principal  minor  of  A  correspond- 
ing to  the  *-th  reduced  form  equation.  Denote  this  principal 
minor  by  Aip. 
It  follows  that 

zpAivzF  <:  gu  z'FM~lzF. 

Proof:  Denote  a  vector  of  mn  elements  by  w. 


(17) 


Let  det. 


d2L 

dnikd7tn 

X 

X 

0 

=  A, 


where  A  is  a  vector. 
Form  the  difference 


2   Aw^WiW, 

i,3  =  l 


2    AHWiWi 


'00 


Zl00  is  formed  from  A  by  deleting  the  last  row  and  column. 
A00ij  is  formed  from  A00  by  deleting  the  f-th  row  and /-th  column. 

mn  mn 

A    2  A00Jjwiwj  -  A00  2  AuU>iU>i 


AA 


oo 


By  Jacobi's  theorem  on  determinants11)  we  have 


u)  See  e.g.,  A.  C.  Aitken,  Determinants  and  Matrices  (London:  Oliver  and 
Boyd,   1942),  pp.  98—99. 


-AA00  >  C 

. 

d2L 

d7iihdnn 

=  ^00 
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therefore 

mn  I  mn  \  2 

2  A0iAiiiwiwj  I  2  ^oi^l 

(18)  d  =  -  ^ =  -  — >  0. 

zlzl00  zlzl00 

The  difference,  d,  is  positive  since  A  00  is  a  positive  definite  quad- 
ratic form.  Determinants  of  positive  definite  quadratic  forms 
obtained  by  successive  rows  and  columns  of  bordering  alternate 
in  sign;  hence 

Thus  by  bordering 

(19)  det. 

with  one  row  and  column,  we  find 

(20)  w'(A-%0w^w'(Am)-*w. 

By  bordering  A  with  a  row  and  column  to  form  (n^J),  we  similarly 
find 

(21)  ^'[(n^Joo.oo^  ^  W{A-i)nW, 

and  so  on  for  successive  borderings.  By  letting  all  elements  of  w 
vanish  except  the  m  elements  in  zF  corresponding  to  the  mxm 
principal  minor  of  A  associated  with  z'-th  reduced  form  equation, 
we  have 

(22)  zFAivzp  ^  ati z'p M"1  zF, 

as  was  to  be  proved. 

The  total  variance  of  forecast  from  a  system  of  structural 
equations  is  composed  of  two  factors,  the  variance  of  disturbances 
au  and  a  quadratic  form  (plus  unity)  in  the  assumed  values  of 
the  predetermined  variables.  The  matrix  of  this  quadratic  form 
when  multiplied  by  gu  is  the  variance-covariance  matrix  of 
reduced  form  parameter  estimates.  By  adding  information  to 
the  system  in  the  form  of  restrictions  on  the  parameters,  we 
decrease  the  magnitude  of  this  quadratic  form.  The  estimated 
value   of  forecast  error  will  reflect   differences  in  estimation 
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procedure,  the  calculated  variance  of  residuals  being  lowest 
when  no  restrictions  are  used.  Nevertheless,  the  underlying  or 
inherent  error  using  the  true  value  of  au  in  the  formula  for 
prediction  error  will  be  smaller,  the  more  one  employs  a  priori 
restrictions  in  calculating  the  structural  characteristics. 

The  succession  of  inequalities  derived  shows  that  full-infor- 
mation-maximum-likelihood estimates  will  be  more  efficient 
than  Hmited-information-maximum-likehhood  estimates  since 
more  restrictions  are  used  with  the  former  than  with  the  latter 
set.  Both  of  these  methods,  in  turn,  will  be  more  efficient  than 
unrestricted  least-squares  estimates  of  the  reduced  forms. 

In  connection  with  limited-information  estimates  a  word  of 
explanation  is  in  order.  When  any  single  equation  in  a  system 
is  being  estimated,  this  method  will  produce  a  set  of  reduced 
form  estimates,  separately  involving  each  of  the  endogenous 
variables  in  that  equation.  In  so  far  as  a  single  endogenous 
variable  appears  in  several  different  structural  equations,  there 
will  be  several  possible  reduced  form  equations  that  could  serve 
as  alternative  forecasting  equations.  The  best  among  these  will 
be  that  set  which  yields  the  smallest  quadratic  form 

zFAipzp. 

This  will  be  inferior  to  the  full-information-maximum-Ukehhood 
estimates.  Another  way  of  using  limited-information  estimates 
in  practical  forecasting,  is  by  solving  algebraically  for  reduced 
form  equations  in  a  structural  system,  each  equation  of  which 
has  been  estimated  by  the  method  of  limited  information.12) 
The  foregoing  results  on  the  efficiency  of  the  use  of  a  priori 
information  follow  closely  a  development  by  Samuelson  called 
"the  generalized  Le  Chatelier  principle/'13)  Samuelson  shows 
that  the  sensitivity  of  an  economic  variable  to  a  parameter 
change,  e.g.,  price  elasticity  of  demand,  decreases  as  the  number 
of  restraints  imposed  upon  the  system  increases.  He  proves 
this  proposition  by  bordering  a  matrix  of  second  derivatives  of 
a  function  being  maximized.  In  a  sense,  this  paper  extends  his 

12)  L.  R.  Klein  and  A.  S.  Goldberger,  An  Econometric  Model  of  the  United 
■States,  1929 — 1952  (Amsterdam:  North-Holland  Publishing  Co.,  1955). 

13)  P.  A.  Samuelson,  Foundations  of  Economic  Analysis  (Cambridge:  Harvard 
University  Press,   1947),  pp.  36—39). 


EFFICIENCY  OF  ESTIMATION  IN  ECONOMIC  MODELS  229 

result  from  inequalities  on  diagonal  elements  of  inverse  bordered 
matrices  to  quadratic  forms  of  principal  minors.14)  If  we  were 
interested  solely  in  variances  of  individual  coefficients  in  the 
reduced  form,  his  result  would  be  directly  applicable.  Since  the 
forecast  error  involves  an  entire  quadratic  form,  his  proposition 
must  be  extended.  It  may  also  be  remarked  that  Samuelson's 
proposition  is  not  a  perfect  analogue  for  ours.  His  bordered 
matrices  are  those  familiar  in  establishing  conditions  for  ex- 
tremes, while  ours  would  have  to  be  bordered  further  to  take 
that  form. 

Some  Experimental  Sampling  Results 

The  argument  so  far  has  been  based  on  asymptotic  theory. 
In  a  constructed  experiment  with  repeated  small  samples, 
G.  W.  Ladd  has  produced  some  extremely  interesting  findings 
relevant  to  the  problems  of  this  paper.15)  His  model  consists 

of  two  overidentified  (demand  and  supply)  equations. 

(23)  ylt  =  p12y2t  +  yxlzlt  +  y12z2t  +  ult 

Vlt  =  P 22  Vlt  ~f"   723  ZZt  ~+~   724  ZM  "T"  U2f 

He  assigned  specific  population  values  to  the  structural  coef- 
ficients and  drew  random  numbers  for  the  disturbances  and  the 
exogenous  variables.  The  endogenous  variables,  ylt  and  y2i, 
were  then  computed  from  the  structural  equations  in  (23). 
Ladd  was  particularly  interested  in  the  effects  of  errors  of 
observation;  therefore  he  added  to  each  of  the  endogenous  and 
exogenous  variables  an  error  of  observation. 

Thirty  samples  of  thirty  observations  each  were  obtained  by 
drawing  the  six  sets  of  observation  errors  repeatedly,  while 
the  disturbances  uiti  are  not  changed  from  sample  to  sample. 
The  errors  in  ylt  are,  together  with  linear  functions  of  uw  values 
of  reduced  form  disturbances;  thus  the  model  differs  from  that 
treated  above  by  having  observation  errors  in  the  other  variables. 
For  each  sample,  Ladd  computed  from  the  observations  on  the 

14)  Samuelson's  demonstration  in  the  first  printing  contains  some  minor 
compensating  errors,  but  the  final  results  are  quite  correct. 

15)  G.  W.  Ladd,  "Effects  of  Shocks  and  Errors  in  Estimation:  An  Empirical 
Comparison,"  Journal  of  Farm  Economics,  XXXVIII  (1956),  pp.  485 — 95. 
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y's  and  z's,  estimates  of  the  structural  parameters  by  the  method 
of  least  squares  and  the  method  of  limited  information.16)  From 
each  of  these  two  types  of  estimates  we  can  derive,  alge- 
braically, reduced  form  estimates  in  each  sample. 

If  we  denote  least-squares  estimates  by  a  ^  and  limited 
information  estimates  by  a  a  sign,  we  can  derive  two  possible 
reduced  forms  for  each  sample, 


Vit  =  7n 


(i  +  r^yk  +  Ui  +  r^v) 

V  P22  —  Pl2/  ^  P22  —  Pl27 


tc>A\  ~  P12  ~  P12 

(24)  -723  3       — g-  Ht  -  724   3 3~  *4* 

P22  Pl2  P22  Pl2 

P22~~Pl2  P22  — Pl2  ?22~P\2  R22~Pl2 

or  the  same  equations  with  yn  replaced  by  yn,  y12  by  p12,  etc. 
Alternatively,  without  taking  any  of  the  identifying  restric- 
tions  into   account,   we   can   postulate   general   reduced   form 
equations 

(25) 


Vit  =  Puht  +  £12*2*  +  fiisht  +  Puht 

V%t  =  ^21*1*  +  ^22*2*  +  p23Z3t  +  p2iZit 

whose  coefficients  are  estimated  from  the  least-squares  regres- 
sions of  ylt  and  y2t  on  zlt,  z2t,  zzt  and  zu.  These  regressions  are 
obtained  as  a  by-  product  of  the  limited-information  estimates. 
The  accompanying  table  gives  the  results  of  our  computations 
from  Ladd's  data.  For  the  three  methods  of  estimating  the 
reduced  forms,  I  have  calculated  the  mean  and  standard  devia- 
tion of  the  thirty  sample  values  of  each  parameter.  The  com- 
putations in  the  table  assume,  in  effect,  that  there  are  thirty 
independent  samples;  however,  the  fact  that  the  vectors 
(***i>  ui2>  •  •  •  ui,m)  are  unchanged  in  repeated  samples,  means 
that  the  composite  reduced  form  disturbance,  consisting  of  a 
linear  function  of  uit  and  observation  error,  contains  a  common 
element  in  each  sample.  Since  the  variance  of  this  linear  function 
of  uit  is  not  small  relative  to  the  variance  of  observation  error 
there  is  significant  correlation  in  the  random  terms  from  sample 

16)  In  the  least-squares  regressions,  ylt  is  selected  as  the  dependent  variable. 
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to  sample.  In  comparing  the  different  methods  of  estimation 
we  do  not  have  as  much  information  as  would  be  given  by- 
thirty  independent  samples. 


Table 

Sample  Means  and  Standard  Deviations  of  Estimates  of  Reduced  Form  Parameters 
Obtained  by  Three  Methods 


1.  Reduced  form  coefficients  derived  from  least-squares 

estimates  of  structural 

parameters. 

Population 

Sample 

Sample  standard 

parameter 

mean 

deviation 

Pa 

.05 

.011 

.017 

P12. 

.27 

.094 

.083 

Pl3 

.06 

.116 

.040 

Pu 

.11 

.127 

.036 

P21 

-.16 

-.295 

.280 

P22 

-.90 

-2.054 

.360 

P23 

.30 

.670 

.224 

P24, 

.56 

.770 

.323 

2.  Reduced  J 

Form 

coefficients  derived 

from  limited-information  estimates  of 

structural 

parameters. 

Pu 

.05 

.045 

.046 

P12 

.27 

.254 

.042 

P13 

.06 

.037 

.026 

Pu 

.11 

.123 

.046 

P21 

-.16 

-.127 

.118 

P22 

-.90 

-.957 

.132 

P23 

.30 

.175 

.089 

p2i 

.56 

.559 

.129 

3.  Reduced  form  coefficients  estimated 

directly  from  least 

-squares  regressions 

of  yu  and 

2/2* 

on  zlt    z2t,  z3t, 

and  z 

it- 

Pu 

.05 

.028 

.094 

P12 

.27 

.258 

.049 

Piz 

.06 

.044 

.071 

Pu 

.11 

.121 

.048 

P21 

-.16 

-.027 

.314 

P22 

-.90 

-.951 

.138 

P23 

.30 

.091 

.242 

P2i 

.56 

.576 

.151 

In  comparing  the  first  two  methods,  least-squares  and  limited 
information  structural  estimates,  the  bias  of  the  former  is  clearly 
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shown.16)  In  all  cases,  the  mean  of  the  Hmited-information 
estimate  is  closer  than  the  mean  of  the  least-squares  estimate  to 
the  population  value.  In  six  of  eight  cases,  the  sample  standard 
deviation  of  parameter  estimates  is  lower  for  Hmited-information 
than  for  least-squares  estimates.  This  is  a  reflection  of  the 
proposition  that,  in  general,  the  optimal  properties  of  least- 
squares  estimates  are  not  preserved  under  transformation.17) 

In  comparing  the  last  two  methods,  it  can  be  seen  that 
neither  shows  much  bias,  which  is  as  expected,  but  that  Hmited- 
information  estimates  have  uniformly  smaller  variance.  This 
is  a  smaU  sample  exposition  of  the  results  derived  in  the  preceding 
section. 

Ladd's  results  are  extremely  suggestive  for  our  purposes  though 
not  conclusive.  Of  course,  results  from  constructed  experiments 
do  not  constitute  general  proof  of  investigated  propositions, 
but  Ladd's  experiment  is  far  less  than  perfectly  designed  for 
the  particular  objectives  of  this  paper.  It  would  be  desirable  to 
carry  through  similar  experiments  without  the  complication  of 
observation  error.  Even  within  the  framework  of  Ladd's  ex- 
perimental design,  there  are  changes  that  could  be  made  to 
strengthen  the  present  inquiry.  Both  least-square  structural 
regressions  were  made  with  ylt  as  the  dependent  variable,  but 
other  choices  deserve  to  be  investigated  in  the  present  context. 
This  is  especially  true  since  the  population  correlation  between 
u2t  and  y2t  is  as  high  as  0.56.  It  is  the  correlation  between 
disturbances  and  independent  variables  in  regression  analysis 
that  introduces  bias.  The  results  in  the  table  are  also  restrictive 
in  that  they  deal  only  with  variances  and  neglect  co variance. 
A  single  statistic  constructed  along  the  lines  of  the  standard 
error  of  forecast  would  be  more  adequate.  In  a  future  sampHng 
inquiry  some  of  these  deficiencies  can  be  remedied. 


ie)  Limited  information  estimates  are  consistent  but  not  generally  unbiased. 
In  small  samples  of  the  type  under  consideration,  bias  may  be  revealed  in  them 
as  well.  Here  the  argument  is  simply  that  limited  information  estimates  reveal, 
in  the  sampling  experiment,  a  smaller  degree  of  bias. 

17)  The  standard  deviations  are  computed  about  sample  means.  If  they  were 
computed  about  population  means,  the  least-squares  results  would  compare 
even  less  favorably. 
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